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37 CT.It SL74(d)/(e> Copyright Notice 
A portion of the disclosure of this patent document contains materia) which is 
10 subject to cop)rrigbt protection. The copyright owner does not object to the 
reproduction by anyone of the patent document or the patent disclosure, as it 
appears in the Patent and Trademark Office patent files or records, but 
otherwise reserves all copyright rights whatsoever. 

15 RRT^TKn APPUCATTONR 

This patent document ia a Continuation-In*Pan of U.S. Serial No. 07/977.691 
filed November 13. 1992, pending. This patent document is related to 
THESAPEtmC APPLICATION OF CHIMESIC ANTIBODY TO HUMAN B 
LYMPHOCYTE BESTRIC1SD DIFFEXtENTIATION ANTIGEN FOR 
20 TREATMENT OF B CELL LYMPHOMA," having UOS. Serial No. 07/978.891. 
filed November 13. 1992 (pending), and THERAPEUTIC APPLICATION OF 
CHIMERIC AND RADIOLABLED ANTIBODIES TO HUMAN B 
LYMPHOCYTE RESTRICTED DIFFERENTIATION ANTIGEN FOR 

TREATMENT OP B CELL LYMPHOMA", having UA Serial No. . filed 

25 simultaneously herewiUi. TUs patmt document is rdated to commonly assigned 
United States Serial No. 07/912^92 and entitled "RECOMBINANT 
ANTIBODIES FOR HUMAN THERAPY." filed July 10, 1992. These documents 
are incorporated herein by reference. 



wo 94/1 PCT/US93/1 mt 



MISSING PAGE 



wo 94/11533 PCr/US93/tmi 
AS noted, the advent of the biotechnolosy industry has allowed for the 
production of iai^e quastiti s of proteins. Proteins are the essential constituents 
of all living eells and piroteins are coiwrised of combinations of 20 naturally 
occurring amino adds; each amino add molecule is de5ned Cencoded**} by 
5 groupxnga reodons") of three deoxyxibonudeie add TDNA") moleeules: a string of 
DNA molecules ("DNA macromolecule") provides, in essence, a blueprint for the 
production of spediic sequences of amino adds specified by that blueprint. 
Intimately involved in this process is ribonudeic add ("RNA"); three types of 
RNA (messenger RNA; transfer RNA; and ribosomal RNA) convert the 
10 information encoded by the DNA into, tg a protein. Thus, genetic information is 
generally transferred as follows: DNA ^ RNA -» protein. 

In accordanee with a typical strategy involving recombinant DNA technology, a 
DNA sequence which encodes a desired protein material reONA") is identified 

15 and either isolated fram a natural source or qnithetically produced By 

manipulating this piece of genetie material, the ends thereof are tailored to be 
ligated. or "fit," into a section of a amaQ circular molecule of double stranded 
DNA. This circular molecule is typically refimed to as a DNA expression 
vector,** or simply a ** vector.** The combioatiea of the vector and the genedc 

20 material can be referred to as a "plasmid* and the plasmid can be replicated in a 
prokaryotichost(is bacterial in nature) as an autonomeus circular DNA 
molecule as the prokaiyodc host replicates. Thereafter, the dreular DNA 
plasmid can be isolated and introdueed into a eukaiyotic host kit mammalian in 
nature) and host cells whtcb have incorporated the plasmid DNA are selected. 

25 While some plasmid vectors will repHcate as an autonomous dreular DNA 
molecule in Tr^twmaiUfi cells, itg plasmids comprising Epstein Barr virus 
CEBV*) and Bovine Papilloma virus TBPV^ based vectors), moat plasmids 
induding DNA vectors, and aU plasmids induding RNA retroviral vectors, are 
integraud into the cellular DNA sudi that when the cellular DNA of the 
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eukaryotic host cell replicates, the plasmid DNA will also replicate. 
Accordingly, as eukaryotic cells grow and divide, there is a corresponding 
increase in cells containing the integrated plasmid which leads to the production 
("exprsBsion") of the orotain mafiaiial of interest. By suhiacting tha hoai cells 
5 containing the plaamid to fiivonble growth condidona, significant amounts of the 
host, and hence the protdn of interest, are produced. Typically* the Chinese 
Hamster Ovary ("CHO") cell line is utilised as a enkaryotie host cell, while E. co/i 
is utilised as a prokaryotie host cell. 

20 The vector plays a cnidal role in tha foregoing - manipulation of the vector can 
allow for variaUUty as to where the cDNA is inserted, means for determining 
whether the cDNA was, in fact, properly inserted within the Teetor, the 
conditions under which expression of the genetic material wiU or will not occur, 
etc. However, most of the vector manipulations are geared toward a sini^e 

15 goal— increasing expreaaion of a desired gene product, te protein of interest. 
Stated again, most vector manipulation is conducted so that an ^improved" 
vector will allow for production of a gene product at significantly higher levels 
whn compared to a "non>improved" vector. Thus, while certain of the 
features/aspeets/characteristics of one vector may appear to be similar to the 

20 features/aspeets/charaeteriatics of another vector, it is often necessary to 

examine the result of the overall goal of the manipulation • improved produaion 
of a gene product of interest 

While one "improved* vector may comjprise charaeteriatics which are desirable 
25 for one set of circumstances, these characteriatica may not necessarily be 

desirable under other dreumstances. However, one characteristic is desirable for 
all vectors: increased efficiency, ie the ability to increase the amount of protein 
of interest produced while at the same time decreasing the number of host cells 
to be screened which do not generate a sufficient amount of thie protein. Such 
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increase effidmcy would have stvcral desinble advancageSt including reducing 
manufacturing coats and decrea^ng the t&mo spent by technicians in screening 
for viable colonies which are expressing the protdn of interest. Accordingly, 
what would be iBsMtit and what would significantly improve the state of the 
5 art are expression vectors with such efficiency characteiistics. 



SimMARrOFTHEBimniQlL 

The invention disdosed herein satisfies these and other needs. Disclosed herein 
10 are fully impaired consensus Kotak sequences which are most typically used 
with dominant selectable markers of transcriptional cassettes which are a part of 
an expression vector; preferably, the dominant selectable marker comprises 
either a natural intronie insertion region or artificial intronic insertion region, 
and at least one gene product of interest is encoded by ONA located erithin such 
15 insertion region. 



As used herein, a ''dominant selectable marker** is a gene sequent or protein 
encoded by that gene sequence; expression of the protein encoded by the 
dominant selectable marker assures that a host cell transfected with an 

20 expression vector which indudes the dominant selectable marker will survive a 
selection process which would otherwise kill a host cell not c ont ai n i n g this 
protein. As used hermn, a **transcriptional cassette** is DNA encoding for a 
protein product [eg a dominant selectable marker) and the genetic elements 
necessary for production of the protein product in a host cell (is promoter; 

25 transcription start site; polyadenylation region; etc). These vectors are most 
preferably utilized in the expression of proteins in mammaliBn expression 
systema where integration of the vector into host ceUular DNA occurs. 
Benefidaily, the use of such fully impaired consensus Kotak sequences improves 
the efEdency of protein expression by significantly decreasing the number of 
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viable coloniOT whil at the tame tune. Mgnificantly inereBsia* the amount ot 

protein expressed by such viable coloni s. As used hereto, a 'natural intronic 

insertion region" is a region of DMA naturally present within a g«ia in this ease. 

typically a donunant selectable aaiker. can ba utilizad &r insertion of 
5 DNA encoding a ceBap»aduetofiataiest;aB'artlftdaltot««leiMartlon region 

is 8 region of DNA which is selactivdy created in a geae (again, most typically, a 
dominant seieetabia marker) which can be utilised tor insertion of DNA encoding 
a g«»e product of interest. Infonnatlon regarding iatronie positioning is 
described in Abrams. J.M. «t aL "Intronic Pttsitfoaiaf MaiimiM* Co^imion 
10 and Oo-ampUfication of Nonselaetable Hetarelogeus Genes. V. Bfa Clum., 
2««4:U016 <1989). and UA Patent No. 5.043.270 (both deenaanta being 
ineerporatad herein by reference). 

As defined, disdosad and cWmad herein, a "fiittyiapaiiwi consensus Kosak" 
IS conprisesthafoUowittrseQuenee: 

-3 ♦! 
Pyxx ATG Pyxx 



20 



25 



30 



where: "x" is a nudeotsde eelacted from the groop consisting of adenine (A), 
quanine (G). cytosiae (C) or thyniaa (TV oiaefl (U); Ty" •» a pjntimidine 
nucleotide, te CarTAJ;"ATO"i8aeodonancodiagfer.thaaniinoacid 
methionine, the so^ed "start" eedan; and ;3 and ♦! a» diwctional reference 
peinU vis-8'ifia ATO. -3 is meant to indicate three nudaotidaa apstream. of 
ATG and +1 ia meant to indicate one nudeetida downstream of ATO. 



Preferably, the fully impaifad eenseasus Kosak U part of a matUonina start 
codon that initfatat translation of a dominant selectable marker portion of a 
transcriptional cassette which is part of an espresrion vector. Preferred 
dominant salectaMa markers iadode. but eie not limited to: herpes simplex 
I thymidine kiaaM; adenosiae deaninasr. aeparagiae synthetase; 



virus I 
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Saimonelta his D g«M: santhiae guaaine ph«sphonb tyl trans&rafle; 
hygTomycmBphesphotransfemetaadfiMmydaphosphotraasfem Most 
preferaUy. the dominant selectable flttricer is neomycin phoe^trans&rase. 

5 In partieulariy preferred embodiments of the invention^ at least one eut-of-frame 
start codon lie ATG) is located upstream of the fully impaired consensus Kozak 
start codon. without an in-frame stop codon being located between the upstream 
start codon and the fully impaired consensus Kozak start codon. As used herein, 
the term "stop codon" is meant to indicate a codon which does not encode an 
10 amino add such that translation of the encoded material is terminated; this 
definition includes* in particular, the traditional stop codons TAA, TAG and 
TGA. As used herein, the tenns "in-frame" axid "ou^of-frame" are relative to the 
fully impaired eonaenauaXoxak start codoiv By way of example, in the following 
sequence: 

15 

-3 ♦X 

GAC CKT^C rirv ATfi gifX 

the underlined portion of the sequence is representative of a fiiUy impaired 
20 consensus Kotak(wher«VrBpra8antB a nucleotide) and the codonaGAa CAT 
and GCC an "in-frame" codons relative to the ATG atart eedon. The above-lined 
nucleotides represent an "out-ef-frame* start eodon wUdi is upstream of the 
fuUy impaired eensansu Kosak start codon. Pre&rably, the out-of-frame start 
codon is within about 1000 nudeotides upstream of «he ftdly impaired consensus 
25 Kozak start codon, mora praferaUy within about 350 nudeotides upstream of the 
fUUy impaired eonsanaoa Kbsak start codon, and most preferably within about 50 
nudeotides upstrsam of the folly impaired consenans Kssak start oodon. 
Preferably, the out-of'firamestaxt codon is a part of a consensus Kosak. Byway 
of example, the seqnanca set forth above satisfies this criteria: the nudeotide 

'7- 



10 



wowitsa 

i, a pttrin. <G); nudMtid. -6. -7 i»d -8 encode «i eut- tfraae stan codon (ATG v 
and nodeotide •!! is a purine <A). 

AdditioaaQy. tttiBiation of a Wly impaired cenaensus Koiak within a Mcoodary 
structun (k a ao^allad ".tem-loap- or "hairpin") ie b«»ftdaUy viable to 
impairment of traiiaUUon of the protein encoded by the dominant selectable 
number, to such an embodiaent. the start codon of the fully impaired consensus 
Kozak is most preferably located within the eUm of a atam loop. 

Particulariy preferred expr««ion vectors which incorporate these aspects of the 
invention disdosed herein are referred to as "TCAE." and "ANEX" and 
"NEOSPLA" vectors; particulariy preferred vectors are referred to as ANEX 1. 
ANEX 2 and NEOSPLA3P. 

15 nieseandotheraepectacfthaiftveirtlondisdosedherdnwinbedeUiiaatedin 
further detail in the aectinu to follow. 

20 Figure 1 provide* the relevant portion of mconsanrnttKoMk and aaveral 
particulariy pirfarred fUly impaired eanaeaaoe Kiisak seqacnces; 

Figure 2 p«vidaa ediagrmmatoraptmsantationef the vectora TCAE 5.2 and 
ANEX 1 (TCAE 12) derigned fi» ttpresaion of mauseftnmiM chimeric 

25 inununoglobuHB.^<*«eth8immuiiogIblwitogBn^ 

coDfiguratioD using neomydn phosphatemsfeiwe aa the dominant adectaWe 
marker. 



•8- 
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Figure 3 U a bsatogram eomparing protein expression levels with the vectors 
TCAE6.2aadANEXl; 

Figure 4 provides e dsagramaatie represeatetioa of tlie vector ANSX 2 designed 
5 for expression of mouse/human chimeric immunoglobxilin, where the 

immunoglobulin genes are arranges in a tandem configuratipn using neomycin 

phosphotransferase as the dominant selectable marker; 

Figure 5 is a histogram compaiing protein expression levels with the vectors 
10 TCAE &2. ANEX 1 and AKBX 2; 

Figure 6 provides a diagrammatic representation of a NEOSPLA vector designed 
for expression of mouse/human chimeric immuBoglobuUn; and 

15 Figures 7A, 7B and 7C are histograms comparing protein expression levels with 
the vectors TCAE 5.2 vs. NEOSPLA 3F (7A). AKEX 2 vs. NE0SFLA3F (7B); and 
GK-NE0SPLA3F vs. NE0SPLA3F (70). 

20 

Disclosed herein are nucleic add sequence amngemente which impair 
translation and initiation of» most preferably, dominant eelectable maricers 
incorporated into mtmwnnHan expressian vactoiB bdA wbidi are preGBrafaly, but 
not necessarily, co^finked to an enrfwting sequence for a gene product of interest. 
25 Preferably, the dominant seleetaUe marker comprises at least one natural or 
artificial intronic inaeitien region, and at least one gene product of intereet is 
encoded by DNA located within at least one such intron. Such arrangements 
have the effect of increasing expression efficiency of the gene product of interest 
by. inter aiia. decreasing the number of viable coleaies obtained from an 

•9- 
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equivaitnt amount f plasimd DNA traasftetcd per cell, vdnle mcreasiiic th« 
amount of gcna product exprasscd in M«b eleoe. 



P«r punwM of bravity aad preaantatioMl aOatacy. th* foeua of Ma* ««ction of 
S the pattntdiadoaurawm be pttodpallydirtetrt to aJiwdae dominant »J«a.blo 

marker, neomycin phowhotranefense. which it ineofponted into a 
mouBa/humanehinwrieimmuno^^ulinexpneaion vector. Itiatobe 
wderatood. however, that the invention disclosed henin is not intended, nor U it 
to be construed, as limited to these particalarsyttems. To the contrary, the 
10 disclosed invention is nppUeaUe to mammalian o«pieaaioBiyatems in toto. 

where vector DNA is integrated into host edlalar DNA. 

One of the most preferred medio^ oliliad by those in tha an for predudnff a 
mammalian ceU line that piodneea a high levd of a ptoteia(te 'production cell 

15 line") involves random integration of DNA coding fcr tha desired gene product (ie 
'exogenous DNA") by using, most ty^caUy. a drug resistant gene, referred to as 
a "dominant seieetaUe marinr.' ftat allowa for selection of eella that have 
integrated the exogenous DNA. &atad again, those edls wfaidi pn^erly 
incorporate the exogenous DNA induding. « the drug resistant gene, will 

20 maintain resistance to the corresponding drug. TWs is niOst typically fdUowed by 
co-amplifieation of the DNA encoding for the desired gens product in the 
transfeeted edi br ampli&ii« an adjacent gsne that also encodes for drug 
resistance ("ampliileation gene-X eg lesistaaee to methotrexate (MT» in the case 
of dehydrefolata reductase ®HFR) gene. The ampKfieation gene can be the 

25 same as the dominant selectable mariter gene, or it can be a separate gene. (As 
those in tha art appreciate, "transfeetion" is typically utilised to describe the 
process or state of introduction of a plasmidinto a mammalian host ceU. whUe 
"tranaformatiak'' is ^rpicatty utilized to describe the process or state of 
introduction of a plasnddinto a bacterial host cell). 

•10- 
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Two amplificatioii approacfata are typically ompl yad by thaae in th« art. In the 
Erst the entire pepulatioii of traaafected and dm; restetant edle (aech cell 
comprising at leadt exie integration of the gene encoding for drug resistance) is 
amplified; in the second, individual clones derived from a eini^ cell are 

amplified. Each approach has unique advantages. 

With respect to the first approach, it is somewhat "easier" to amplify the entire 
population (typically referred to as a "shotgun" approach, an apt description) 
compared to individual clones. This is because ampHScation of individual clones 
initially involves* inter alia, screening of hundreds of isolated mammalian 
colonies <each derived from a sin^e cell, most of which being single copy 
integrants of the expression plasmid) in an effort to isolate the one or two "grail" 
colonies which secrete the desired gene product at a "U^" level, is at a level 
which is (typically) three orders of magnitude fainter than the lowest detectable 
expression level. Thaae cells are also often fbund to have only a sin^e copy 
integration of the expression plasmid. Additionally, amplifying individual dones 
results in production ceS lines which eeutain fbwer oopies of the amplified gene 
as compared to amplification of all transfteted cells (typically, 10-20 versus 500- 
1000). 

Mth respect to the aecond approach, prodactien eeU lines derived fium 
amplifying individual donea are typicalfy derived in lower levels of the drug(s) 
used to select for those edonies which comprise the gene for drugCs) resistance 
and the mgenotts fane product (is in the case of aiethotrexata and DHFR, SnM 
versus ImMX Furthemoro. individual dones can tyincally be iadated in a 
shorter period of time (3«6 aaonths versus 6-9 months! 

-a. 
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Idtally then, tha uagible benefiu of both approacfaoo ahouid be mergod: ai a 
pmetical levaL chia would involve decreasing the number of e lonies to be . 
3gr^n#d. w*^ inm^ifing thf ««oiint ptoduet aecreted by these coioniea. The 
present invention accomplishes this task. 

Tlxe position where the DNA of the dominant selectable mariier of the plasmid 
DMA is integrated within the cellular DNA of the host cell determines the level 
of expression of the dominant selecuble marker protein, as is recogniied by 
thoseinthaart It is assumed that the exprt^ionofa gene encoding a protein of 
interest which is either co-linked to or positioned near the dominant selectable 
marker DNA is proportional to the expression of the dominant selecuble marker 
protein. While not wishing to be bound by any particular theory, the inventor 
has postulated that if the gene used to select for the integration of the exogenous 
DNA in the mammaHan cell {ie the dominant selectable marker) was designed 
such that translation of that dominant selectable marker was impaired, then 
only those plasndds which could overcome audi impaiimant by over-production 
of the gene product of the dominant selectable markerwould survive,^, the 
drug-screening process. By associating the exogenous DNA with the dominant 
selectable marker, then, o /lofti, over-production of the gene product of the 
dominant selectable marker would also result in over-production of the gene 
product derived from the assodatad exogenous DNA. Ill aoeordance with this 
postulated appioadi. impainse&t of translatioo of the dominant selectable 
marker gene would be aeeasaary. and an avenue for aueh impairment was the 
eonsenstis Xoxak portion of the gene. 

By compaiing several kuncbed vertebrate mRNAs. Marilyn Kosak in "Possible 
role of Banking nucleotides in recognition of the AUG initiator oodon by 
eukaryotie ribosomee.- Nuc Adda Res. 9: 52»5252 (1981) and XoapUation and 
analysis of sequences upstream from the traoslational start site in eukaryotie 
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oBNAa." Sue. Adds Res. 12: 857-872 {X984), proposed the foUowUig 'eonsensu* 
aequMiea for initiation of tmislatioa in higher eufcanroias: 



-3 *l 
5 ee Aec UlU G 

(AS those in the aitawwdate, uraell. U. fcplacM the d««y«idBOtide thymine. 
T.inRNA.) In thbeecioenee. referred to oee-eonsensusKoMk- the most 
highly conseived nudeotidee are the porines, A and O. shown in capital letters 

10 abo»a;mutttionalaa«lj8l»«»»finnedthattheaeti«)podtionsto^ 

innuaneeoninitiatioo. s«;«g.ItoMk.M.-Efltetsofinterti8tr0nic length on th. 
efEdencycf ..Initiation of«nkaryoticriboson«.- Mot Ottflto. 7/10:3438^5 
(1987). Kosakfuither datenained that alteraliona in the sequence upstream of 
the coBaensoa Konk «Hi effect translation. For example, in Tnfluences of 

15 mRNA secondary atmctur. on initiation by eukaryotie ribosomes " PNAS 83: 
2850.2854 (1986) Koak dawsibes the "aitifieial- introduction of a secondary 
hairpin struebire region upstream ftom tha consensus Xbsak in several plaamids 
that encoded prepndasulin: it was en>«i«entally determined that a stable stem 
loop structure inhibited translation of the prepwinsulin gene, reducing the yield 

20 of preinsuUn hy 85>95«. 

SurpiisiBgly.ttwa»discover«lbyth«inventorthatby changing the purines A(-3 
vis^n-via ATO start codon) and G (+1) to pyrinridines. translation impairment 
was significant: when the consensus Koxak for Uie neomycin phDsphotransferase 

25 genawa88ubjectedtosuchalterationa(aswinbesetforthindetailbdow).the 
number of 0418 rwistant cdoniea significantly decreased: however. th«« was a 
rignifieant increase in the amount of gene product CKpressed by the individual 
G418 resistant dones. As those in the art wiB rscogmw. this has the effect of 
increasing the effideney of the espression system-there are less colonies to 

30 screen, and most ofthe colonies that are tiaWepreduoedgnifieantly more 

•U- 
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product than would ordinarily bo obtaisod. Confirmation of the inventor's 
postulated theory waa thus ezperiiuentally detenmned 



A€ noted* for purposes ofthis patent document a. "consensus Kosak" comprises 
5 the following soquanoe: 

-3 

Prx ATG Pxx 

10 a "partially impaired consensus Kosak' comprises the following sequence 

-3 *1 
P/Pyxx ATG P/Pyxx 



15 and a disclosed and claimed "fully impaired consenaus Kesak** comprises the 
following sequence: 

-3 

Pyxx ATG Pyxx 

20 

where: "x" is a nudeotide selected from the group consisting of adenine (A), 
guanine (O). cytosine (C) or thymine CD (uracfl, U. is the case of RNA); is a 
purine,ie A or G. Ty" is a pyridine, i« Cor TAJ; ATG i» a conventional start 
codoa which encodes for the amino acid methiooine (Met); the numerical 

25 designations are relative to the ATO eedon, U a negative number indicates 
"upstreaffi" of ATG and a positive number indicates "downstream* of ATG; and 
for the partially impaired coaseasus Kuak, the following proviso is appUcable- 
only one of the -3 or ♦! nucleotides is a pyridine, eg. if -3 is a pyridine, then +1 
must be a purine or if*3 is a purine, then 4*1 must be a pyridine. Most 

30 preferably, the fully impaired consensus Koiak is associated with the site of 
translation initiation of a dominant selectable marker whi A is preferably (but 
not necessarily) co-linked to exogenous DNA which encodes for a gene product of 
interest. As used herein, ^'nucleotide' is meant to encompass natural and 
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synthetic deosy* and liboaudeotides as wall as modifiad daoxy- and 
ribonucleotides. U where the 3* OH. 5'OR. sugar and^or hetarocydie base are 
modified, as weO as modification of the phosphate backbone, eg methyl 
phosphates, pbosphorothsoates and phosphoramidites. 

Information reifarding the gene sequence of the dominant selectable marker is 
preferaUy known; howaver. in lieu of the entire sequence, information regarding 
the nudde add sequence (or amino add sequence) at the site of translation 
inidatlon of the dominant selectable marker must be known* Stated againt in 
order to effectuate a diange in the consensus or partially impoired consensus 
Kozak. one must know tha sequoice thereof. Changing dsaensensus or 
partially in^aired consensus Xbzak to a fully impaired consensus Kozak 
sequence can be accomplished by a variety of approaches well known to those in 
Che art induding, but not limited to, site specific mutagenesis and mutation by 
primer«based amplification {eg PGR); most preferably, such change is 
accomplished via mutation by primer-based amplification. This preference is 
prindpally based upon the comparative "ease* in accomplishing the task, coupled 
with the efficacy associated therewithu For ease of presentation, a description of 
the most preferred means for accomplishing the change to a fully impaired 
consensus Kozak will be provided. 

In essence, mutation iy primer-based amplification relies upon the power of the 
amplification process itself-^as PCR is routinely utilised* fi>cu8 will be directed 
thereto. However, other primer-based amplification tedmiques (c;g Hgase chain 
reaction, etc) are applicable. One of the two PCR primers fmutational primer**) 
incorporates a sequenca wbscb will ensure that the resulting amplified DNA 
product will incorporate the fully impaired consensus Kozak within the 
transcriptional cassette incorporating the dominant selectable marker of 
interest; the other PCR primer is complementary to another region of the 

•15- 
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donanant selectable marker; a transchptioiial eassetta uicorporating the 
dominant selectable masker; or a vector which cominis s the transcriptionai 
cassette. By way of example, the complement to a dominant selectable marker 
which indudeo n censensue Xaaak could have the GiUowinff seQuenca (SEQ ID 

• S MO: n 

3 ' -ta^ccaggTccTACCcc-S ' 

In order to create a fiiUy impaired consensus Kosak, the mvtataone] primer could 
20 have the following sequence (SEQ ID NO: 2) (for convenienca, SEQ ID NO: 1 is 
placed over the mutational primer for comparative purposes): 

5 • -are9arcet99AT6C99*3 • 
3 ■ -tagetaggTccTACCec-S ' 

15 

As is evi<tent, complementarity is lacking in the primer [set • the symbolsX By 
utilizing excess mutational pnmer in the FOR reaction, when the sequence 
including the consensus Kozak is amplified, the resulting amplified DNA 
20 products will incorporate the mutations such that as the amplified DNA products 
are in turn amplified, the mutations will predominate sudi that a fiUly impaired 
consensus Kocak will be incorporated into the amplification product. 

Two criteria are required for the mutational primer^first, the length thereof 
25 mustbesufSdent such that hybridizatioa to the target will result As will be 
appreciated, the matational primer wiS not be 100% complementary to the 
target. Thus» a sufficient number of com]dementary bases are required in order 
to ensure the requisite hybridization. PreforaUy. the length of the mutational 
primer is between about IS and about 60 nucleotides, more preferably between 
30 about 18 and about 40 nudeotidea, althou^ longer and shorter lengths are 
viable. (To the extent that the mutational primer is also utiliaed to ineorporau 

46- 
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an ou^cf-frama start oodra or seeoadary structure. th« length 6t the mutational 
primer can correspondingly increase). Second, the ratio of mutational primer to 
target must be sufficiently excessive to "force" the mutation. Prefer^ly, the 
ratio of mutational pnmer to target is between about 250:1 to about 5000:1. more 
3 preferably between about 400:1 to about 2500:1. and most preferably between 
about 500:1 to about 1000:1. 

Because the paramatars of a PGR reaction are considered to be well within the 
level of sksll of those in the art, details regarding the particulars of that reaction 
10 are not set forth herein; the skilled artisan is readily credited with recognizing 
the manner in ^ch tbis type of mutation can be accomplished using PGR 
techniques * the foregoing is j^rovided as a means of providing elucidation as 
opposed to detailed edifieetion. 

15 As notedt it is most preferred that the fully impaired consensus Xosak is 

associated with the site of translation initiation of a dominant selectable marker 
incorporated into a tranacxiptsonal cassette which ferns a part of an expression 
vector. Preferred dominant seieetable markers include, but are not limited to: 
herpes simplex vims thymidine kinase; adenosine deaminase ; asparagine 

20 synthetase; Salmontlia his D gene; xanthine guanine phosphoribesyl transferase 
("X6PRT); bygromydn B phosphotransfimse; and neomycin 
phesphetransfbrase CNEO*). Most prefoaUy. the dominant selectable marker 
is NEO. 

25 The ^i*"^Mfit adectable maricer herpes simplex virus thymidine kinase is 
reported as having the following partiaUy impaired consensus Kosak: 

•3 +1 
cq C9t ATG GCC 

30.. 
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Set. Hdltr. S. laMrtfeoal AetWti« of a Piometeri(M» Tbyaiiin Ktoaw Gene. ' 
laoL A Cell. Bio. «/ft3ai8-330«. Figure 4. audtetide 764. By changing the *\ 
purina (G) to a pyrimidina (C «r TfU). a Ailly impaifad Koxak as defined herein is 
generated (tha -S'of tiia herpea naiplex virua thymidine kinase is a pjrriraidine). 
S caiangteg+lpu«toatoapyrimidiae«»"'««»'«"»«»«f«»'««^ 

amino add from alanina (CCD to proline (CCD or serine (TCD; it is preferred 
that consarvatiw amino add changes re«alt from the changes to the nucleotides. 
Hius. it is preferred that the change to TCTbe made because the change f«« 
alanine to serine la a more conservative amino add change than changing 

10 alanine to proline. 

Histidinol dehydrogenase is another dominant selectaUa marker. See. Hartmen. 
S.C. and MuUigan. R.C. "Two dominant aeling sdcetaUa markers for gene 
transfer studies in mammalian cells." PWA5 85:8047.8061 (1988). Tl« hi, D 
15 gen.ofSal»oneltatyphim«n»umhaathaia»»iagP«rtlallyimpairedconse^^^ 

Koxak 



20 



25 



-3 ♦! 
9C A9a ATG Tta 

A s . 3 is a purine, changing -3 to a pytiimdina (C or T/0) results in a fiiUy 
impaired conaansuaKoaak: ., i« .ppwdat.1. because these niidaotidas are 
upstTtam of tha start codon.no impact an amino add tnuttlation results from 

this change. 

Hygromydn B phosphotnmsfaiasa is am»ther damUunt aalactaWe marker the 
raportad saquanea for tha A|A gana (see. Grita. h. andDavias. J. "Ptosmid 
ancedad bygiamydaB rasUtaaee: the eequence of hygromydB B 
phosphotiwsfcrasa gaaa and ite aiprassiag in acteneWo eofi ^ 
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Sueehanmyces cenvisiM" Gent :179*I88. 1983) indicates that the consensus 
Xbtakis: 

-3 ♦! 
9a Gat ATC Aaa 

5 

0»ih .0 una M puHms; lH«a etianigliltf -fl Md i^jfKMUlfiM mutu l» u 
fully impaired consensus Kosak (this results in the foUowing encoded amino 
adds: ♦! toC -glutamine;*! toT-stopcodon. Because this codon is 
downstream of the start codon. the change to the stop codon TAA should not be 

10 accomplished). 

XGPRT is another dominant selecuUe marker. The reported partially impaired 
consensus Kosak of XGFRT has the foUowing sequence: 

15 -3 

tc Cac ATG Age 

See, Mulligan. KC. and Berg. P. "Factors governing the expression of a bacterial 
gene in mammalian celU.- Mot A Ceil Bto. l/5:44»^9 (1081), Figure 6, By 
20 changing the ♦! purine to a pyrimidine, a fully impaired consensus Kosak is 
created; the effect on the encoded amino Mid (AGC-seitoe) is as foBw^^ CGC 

arginine: TGC-cysteine. 



25 



30 



Adenosine deaminase (ADA) can also be utiliaed as a dominant seleetable 
marker. Hia reported eonsensttsKbsBk sequence fcr adenosine deaminase is: 



-3 ♦! 
ga Ace ATG Gcc 



See, Yeung, aV. €t oL. ^Identification of functional murine adenosine deaminase 
cDNA doaes by eomplementati^i in Edierichsa coU," J. Bio. Chem, 
260/ M:10299-10307 (1985), Figure 3. By chan^ng both -3 and ♦! purines to 
pyximidanes, fully impaired consens Kosak sequences result The encoded amino 

•19- 
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addcormpoiidiBgtoGCC(aianiiie)isehangidt either ^lia (CCC) or serine 
(TCC)» with the elienge to seiteebeinff preferred, due to the eoa^^ nature 
ofthbchaiige. 

5 The reported partially impaired coaseneue Kozak for aiparagine syatheuse la as 
follows: 

-3 

gc Acc ATG Tgt 

10 

Andnilia^ LL. et aL, "Isolatioa of hnman cDNAe for asparagine synthetase 
and expreseioa in Jensen rat sarcoma cefis." MoL Cttf. Bio. 7/7:2435-2443 
(1987). Changing the purine to a pyrimidine results in a fully impaired 
consensus Kozak. 

IS 

The partially impaired eonsnsus Kbssk for neomydn phoq^transfisrase (which 
includes an upstream out of frame start eodon) is as follows: 

-3 +1 

20 9gLJL£9 99^ ticg tct Cge atg Att 

Changing the ^1 purine to a pyrimidtne has the effect of creating a f^^^ 
impaired consensus Kozak (changes to the encoded emine add isoteudne, ATT 
areasfollows: CTT • leudne and TTT-pheoylalanine. with the change to leucine 
25 being preferred, dna to tf» conservative nature this change). 

The foregoing is not intended, nor is it to be construed as limiting; rather, in the 
context of the disdosed invention, the foregoing is presented in an effort to 
provide equivalent esamplea of changee in the reported consensus Kozak 
30 sequaess or partiatty impaired consensus Kozak sequences of several well* 
known dominant selectable aiaricers. 
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As noted, a most lyrefomd doaiaant adeetaUo marker le NSO. Panieuiarly 
preferred fUly ia^atxed eonseneue aeqaeaeea for NEO are as foDows: 

Txx ATG etc <SEO ID NO: 3) 

Cxx ATG CCt <5£0 XD NO: 4) 

Txx ATG Ttt iSCO ID NO: 5) 

Cxx ATG TtC <S£Q XD NO: 6) 

where x are nudeotidee. SEQ. ID. NO. 3 is most prefer r ed ; and zx are preferably 
CC. 

Other transcriptioxud eassettes. which may or may not indnde a fblly unpaired 

consensus Kotak, can be incorporated into a vector which indudes 

transcriptional cassettee containing the disdosed and daimed foSy impaired 

conseneue Kbsak; audi *other transciiptionai casesttes" typically are utilized to 

allow for "enhancement," "amplification" or "regulation" of gene product 

repression. For ezsjnpla, eo-transfeetion of the eaogenons DNA with the 

dehydrofolate reductase OSBFR) gene is eawmplary. By increasing the levels the 

antifolate drug methotrexate (MTX), a competitive inUbitDr of DHFR. presented 

to such cells, an increase in DHFR prodoctf on can occur via amidification of the 

DHFR gene. Benefidally, sztensive amounts of flanking exogenous DNA will 

also become amplified; therefore, e xog en o us DNA inserted co-linear with an 

expressible DHFR gene will also become overexpressed. Additionally, 

transdptional cassettes which allow for regulation of expression are available. 

For example, temperature sensitive COS cells, dehved by pladng SV40ts mutant 

large T antigen gene under the direction of Rous sarcoma virus LTR (Insensitive 

to feedback repression by T antigen), has been described. See, 227 Science 23*28 

( 1985). These cells support replication &om.SV40 ori at 33^ but not at 40''C 

and allow regulation of the copy number of transfected SV40 oh-containing 

vectors. The foregoing is not intended, nor is it to be construed, as limiting; 

•2X- 
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rather the foregoing is intended to b ezempUry of the types of caseettes which 
can be incorporated into expression veetore comphsing th died eed folly 
impaired consens\2S Kosak. The skilled artisan is credited with the ability to 

S ebjeetive of the expreeaion eyetam. which are applieaUa and which can be 
advantageously exploited. 

As indicated abova. in particulariy preferred embodimenU of the invention, at 
least one out^f-ftama start codon (if ATO) is located upstream of the fUly 
10 impaired consenaus Kosak ataxt codan. without a atop codon being located 
between the out-of*fkame start codon and the fully impaired consensus Kosak 
Stan codon. Tha intant of the out-of^firame start codon ia to. in efiectt further 
itnpyir translation of the dominant sdectaUe marker. 

15 As used herein, tha tmn 'stop codon" is meant to indicate a "nonsense codon. " ie 
a codon which does not encode one of the 20 naturally occurring amino adds such 
that translation of the encoded matenal terminates at the r^on of the stop 
codon. This definition includes, in particular, tha traditional stop codons TAA. 
TAGandTGA. 

20 

As used herein* tha term "out<f*frame" is relative to the fially impaired 
consensus Kosak start codon. As those in the art appreciate, in any DNA 
macromolecttla (or RNA maeromolecule) for every in-frame sequence, there are 
two out^f-ftama sacpiattoes. Thus, for example, with respect to the following 
25 sequence incorporating a fuUy impaired consensus Kosak: 

•3 ♦! 

Tfi e gaT C ye CXX ATG CXX 

30 the in.frame codons ara separated by triplets, eg. gcA, TGc. cAT and Ggc; the out 
of.frame codons would include, eg cAT. ATG, Gcc ccA. ATG and TGg. Thus, two 

.22^ 



wo 94/t 1523 PCT/U593/1 1221 

start codons (in capital lettafs and undariined) an out-of-fraaM ralativ to the 
start codon of the fuBy impaired ceaansua Kmk* 



Whmn on mt-atVlramo mimH oadaa it uiiliaad. it ii orafarv^d that this bo 
5 within aboat 1000 nudeotidei upatream of Che ftOly impaired qpaaeneue Kosak 
start codon, more preferably within about 350 nndeotidee upstream of the fully 
impaired consensus Koxak start codon, and most preftrably within about 50 
nucleotides of the fuBy ia^aired eonsenaua Kbaak start codon. | 

10 As is appredaled, the upstream sequence can be manipulated to achieve 
positioning at least one oatmf-fraaie sequence upstream of the fiilly impaired 
consensus Kbzak start eodon using (most preferably) a mutational primer used in 
the type of ampUfication protocol described above. 

15 Utilization of a ftiDy impaired conaensus Kosak start codon located within a 
secondary structure iU a "stem-loop" or "hairpin"} is beneficially viable to 
impairment of translation of the protein exieoded by the dominant selectable 
marker. In such an embodiment it is preferred that this start eodon be located 
within the stem of a stem loop secondary structure. 'Diese, by way of schematic 

20 example, in such an embodiment, the atart codon of the fully impaired consensus 
Kozak is positioned as foDows: 
T 

X X 

25 XX 
X A 
X T 
X G 
xA TGxx Cxx 

30 

(An out-of-frame start codon which is not part of the secondary structure is also 
represented*) As is appredaUd. within the stem loop, complementarity along 
the stem is, by definition. Really required. For exemplary methodologies 
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regarding, int^r alia, imroduction of sudi secondaiy stntctum into the 
sequen€e. as well aa infemation regarding secondary stnieture stability, see. 
Kosak. PNAS. 1986, supra, 

• 5 Ad noted, it is preferred that a dominant eelectable market with a naturally 
occurring intronic insertion region or an artificially created intronic insertion 
region be utilized and at least one gene product of interest inserted within this 
region. While not wishing to be bond by any particular theory, ihe inventor 
postulates that such an arrangement increases expression effioenor because the 

10 number of viable colonies that sumve the sdectxra process via-a-vis the 
dominant selectable marker will decrease; the colonies that do survive the 
selection process will, by definition, have expressed the pvotaia necessary for 
survival, and in conjunction therewith, the gene product of interest will have a 
greater tendency to be expressad. As ftirther postulated* the RNA being 

15 transcribed from the gane product of interest within the intronic insertion region 
interferes with completion of transexipCioa (elongalioa of RNA) of the dominant , 
selectable marker, therefore, die position that the dominant selectable marker is 
integrated within the caDular DNA is likely to baa position where a larger 
amount of RNA is iidtiany transeribed. 

20 

Aa is appreciatad. prokaryotie proteins do not typically indude ^fices and 
introns. However, the m^tyofdemiaantselaetablama^rawhidi are 
preferred for expression vector tedmdogy are derived from prokaryotie systems * 
Tlius, when prokaryotio^mved dominant selectable markers are utaised. as is 
25 preferred, it is oOea necessary to generate an aitifidal splice witUn the gene M 
to create a location for insertion of an intron comprising the gene product of 
interest It is noted that whfle the ftUowing rales are provided for selection of a 
spUce site in prokaryotie genes, th^ can be readily applied to eukaryotic genes. 

•24* 
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A genersl mtfhanim ftr th» aplidag of mtSMnfer RNA pncnn n in eukaryotic 
cells 18 delineated and aummarixed in Sharp, PhiUp A. "Splidn^ of Messenger 
RNA Preeursora" Soenee. 235: 736-771 (1987) (see. in particular. Figure 1) 
which ie insatMraiad hmisi by ra&renea. Based uMn Sharp, tbafa are faur 
5 Tninimw criteria in the nudeie add sequence which are neeessary for a spUee: 

(a) 5' splice donor; (b) 3' splice acceptor, (c) brandi point, and (d) poiypyrimidine 
tract. The consensus sequences for the 5' splice donor is reported to be 
C A 

AAG/GTGAGT and for the 3* spUce acceptor. NCAGM} (where a T symbol 
todicates the splice site); see, Mount, SJtL *A Catalogue of Splice Junction 

10 Sequences'* //tie. Acids. Res. 10/2:459472 (1992) which is incorporated herein by 
reference. The consensus eequenee for the branch point, ie.tha loeatian of the 
lariat formation with the 5' splice donor, ia reported as FyNPyPAPy; and the 
reported preferred branch site for mammalian RNA splicing ia TACTAAC 
(Zhuang. Y. at al. UACUAAC ia the preferred brandi rite for mammalian mRNA 

15 splicing* PNAS 86: 2752-2756 (1989), incorporated herein by reference)* 

lVpiealIy» the brandi point ia located at least approiimat^ 70 to about 80 base- 
pairs from the 5*-^Bee donor (there is no defined upper limit to thie distance). 
The poly pyximidine tract ^rpically ie from ebout 15 to about 30 base pairs and ts 
most typically bounded by the branch point and the 3' splice acceptor. 

20 

The forgoing is descriptive of the criteria imposed by nature on naturally 
oceuRing splicing madmnisme. Because there is no exact upper limit on the 
number of baee paira between the 5* splice donor and the brandi point, it is 
preferred that the gene ^oduct of interest be inserted within tUs region in 
2S situatiena where a nntural intron exists within the dominant sdectable marker 
Ho weiver, as noted, auch introns do not »st within most of the preferred 
dominant selectaUe markers; aa such* utilisation of artificial introns are 
preferably utilised with these maricers. 
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la order to generate ea ortl&da] iatron. e "s^ee donorsplice acceptor" site must 
be located witfaia the encoding region of the dominant sdectaUe marker. Based 
uoea Shera and Mount* it is most preferred that the ftUewihff eeouence function 
S as the splice donor»plioe acceptor site CAGG (with the artlfldal spUce 
occurring at the GG region). A preferred sequence is AAG6. 

Focusing in on the most preferred sequence CAGG, the foUowing codons and 
amino adds can he located within the encoding region of the dominant selectable 
10 marker for generation of the artificial intronoc insertion region: 

A B C 

CAP/ sm ziCA asa 

Gin Ala Ala 61f Ala Leu Arg 

Asp Pro 

Qir Ser A« ^rt 

Ghi Thr ^ 
Val cy» ^ 

Qly Tyt 
Kie Val 

ne 



(Aa wiU be appreciated, the aanie approach to determining viaUe amino acid 
residues can be utilised for the preferred sequence of AAGG). The most 
15 preferred eodon group fbr derivatien of the spUee donorspliee acceptor site is 
groupA. Once these amino acid eequences are located* a viable point for 
generatim of an artificial intronic insertion region can be defined. 

Focusing on die prcfenred NEO dominant selectable markerp amino add 
20 residues Gin Asp (eodon group A) are located at the positions 51 and 52 of NEO 
and amino add residues Ala Arg (eodon group C) are located at positions 172 and 
173 (as is appiedated, multiple artificial intnmic insertion regions may be 
utilised). Focusing on residues SO • S3 of NEO. the nudeic add and amino add 
sequences are as foDows: 
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50 51 52 S3 

5' CTG CAG OAC GAG 3' 
Utt Gin Asp Gltt 

Accordingly, an artiScial intronie insertion region can bo generatad between 



point, a polypnmidine tract and. preferably, a region fl>r inaartion of a gena 
product of interest, te a region amendable to ensymatie digestion. 

Two chtezia are import for the artificial intrememseition region: thefirsttwo 
10 nucleic add residues of the 5' splice site ( eg abutting CAG) are most preferably 
GT and the first two nudete add reaidttea oftha 3' aplica site (ct abutting G) are 
most prefierably AG. 

Using the criteria defined abouva. an ardfidal introme insertion region waa 
15 between amino add residues 51 and 52 of NEO: 



20 

(Details regarding the methology for creating tfaia artiftdal intronie insertion 
region are set forth in the Enmple Section to follow). The Not I dte was created 
as the region whera the gene product of interest can bo incorporated. Therefore, 
upon incorporation, the gene product of interest is located between amino add 
25 residues 51 and 52 of MEO» such that during NEO transodaaion, the gone 
product of interest trill be "spUeed-out*. 

The host call Una is most prefierab^ of manwnalian origin; those ekilled in the art 
are credited with aUUty to prefierentially determine particular host ceQ lines 
30 which are bast suited for the dedred gene product to be expressed therein. 
Exemplary host ceO linea indude, but ani mt limited to. 0G44 and DXBU 



0 r«»<rfuM at ttfMl aa orKSO. this P«el»e mvM e«wf^«ak|jr —mpHmmm m hmmmmH 



SO HBO 51 
USU QLN 
CPS CAG/! 
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cardnoBa). CVI (a«nk«y kidney lint). COS te d«riv»tlv« of CVl with SV40 T 
»ntit«B>> 91610 (Chinese haaster fibroblast) BALBC/3T3 (moose fibroblast). 
KAK (hamster Udaey UaeX SP2/0 (mouae aqrdema), P3i8S-A83.6S3 (mouse 
myeloma). BPA-ldBPT (bOTtoe endothelial eeUs). RAJI (human lymphocyte) and 
5 293 (human iddney ). Host eeU linet ere typicaUy ovailable tnm commercial 
sendees, the American Tiuue Culture Collection or from published literature. 

Preferably the host ceD line is either DG44 or SP2«). See. Uriand. G. et al.. 

"Effect of gamma taya and the dihydrolblate reductase locus: deletions and 
10 inversions." Som. CeU & MoL Gen. 12/ft555-566 (1986) and Shulman. M. et al.. 

"A-better cell line for making hybridomas secreting specific antibodies.' Nature 

276:269 (19T8). respectively. Most preferably, the host eeU line is DG44. 

Tranafection of the plasmid into the host cell can be accomplished by any 

techniime available to those in the art. These indude, but are not liaiud to, 
15 transfection(induding electrophoresis and electropOT«tion),eallfta8ion with 

enveloped DNA, microinjection, and iaftetion with intact viwa. See, Ridgway. 

A.A.G. -Mamma]ianE«p»«Mi«nV«to«.* Chapter 24A ». *704'2 Vector*. 

Rodriguez and Denhazdt. Eds. (Bttttarworths, Boatoa. MA 2988). Most 

preferably, plasmid iatcedoelion into tha host is via daetreporation. 



20 



The fbQowiag examplaa are not lataaded. nor at* thoy to ba eonatouad. as 
Umiting the invantlan: the asam^es are iatanded to damonstntatba 
25 applicability of an eabodlmant eftha iBMatiea dttdofura herein. Tht disclosed 
fully impaired eenaaasua Kosak aequanm is inteoded to ba broadly applied as 
delineated abova. Howwrer, for preseatatieaale«dan«y, exemplary uses of 
panioilariy prrfarrad ambodimaBtt of fclly impaitwi eoosaasw Xozafc sequences 
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^ utilixMl In eofljunrtMi with tandem ehimerie uitfbody wpreasion vectors 
(alio tefemd to hewia M aatflwdy exiwawlw veetan) M diedwed beU^ 

I. TANDSMCHWEWCANimODTiai^ESSIONnCCAEnVECTOR 

* B cell lymiAocytes arise fh,«pluripot.nt stem cdl. and proceed through 
ontogeny to fiiDy matured antibwly secreting pl«una ceBs. Tbe htamn B 
lymphocyte^strieted differentiaticn antigen Bp35. referred to in th. art as 
• CD20.- is a cell ««fae. non-glycosylated pbosphoprotain of 35.000 Oaltons: 

10 CD20i..xpres.edd«ri«gearlypre.Bcelldevelop«entju«tpri«rtothe 

expression of CTtoplwrniCi heavy chains. CD20 is expressed «B«.t«»Uy until 
theplasmaceBdiffer.ntiation.tage. The CDZO molecule regulat... step in the 
activation proces. which i. re«p«red for cell cycle initiation and differentiation. 
Because CD20 is expressed on neoplastic B ceBs. CD» prwride. a pro«i«ng 

15 targetfortheraTVofBceDly»phoaas«ndl«ita»i...Th«CMO«.tigenis 

especially suitable as a target for «iti^ «itibody mediated therapy because 
of accessibiUty and sensitivity of hematopoirtic tamer, f lyn. via immune 
effectormecbanisms. Anti-CD20 antibody medirt«l therapy. inter o/ic is 

di8closedinco^«ndingSerialNo..07/978.»landSerialNo. 

20 simultaneou.lyberewith.'niea«tibodiesutili..d«.«o«..«i««anchimeric 

anti^20 antibodies exp,«u»d at high tevd. in ««««aliaa cdls (chimeric Mid- 
CD20-). Thisantibodyw«derlv«dwtogvecter.di«ao..dher«h.towlt:TCAE 
5 2 ANEXl;ANEX2: GKNBOSPLASPjandNEOSPLMPC-taddltiond 
vector. TCAE 8. w«i d«. utili««l t« derive chim«ie «iti^20 «ttibody TCAE 
25 8iaidentic.lt«TCAB8.2a«VtthattheNB0t«ndati»al«.n.itei.a 
partianyimpairedconsm-usKo-k. TCAE 8i.d..c.ibedi.tb.oo^ng 

patent document filed herewith.). 
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In co«aMay-««gn«l United Stat- Serial Ntt»b« 07/912.292. disdwed. i«er 
ofia. are httman«)W World monkey ehimerie aatib dier. an embodiment of the 
invention disclosed therein ar hninan/macaquedumeritant4-CD4 antibodies in 
vector TCAB 6 (««. Figure 6 of Serial Number 077912.292. and eomspoading 
discussion). TCAE6i.substantiallyidenticaItoTCAE8.2;TCAEflco«tain. 
human lambda constant region, while TCAE 5.2 omtains human kappa constant 
region. TCAE 5.2 and ANEX 1 (refetwd to in that patent docuanit as TCAE 12) 
are disdosed as vector, whidi c«i be utiH.«i in eoajunction with hnmanrtJld 
Worid monkey dUmaricantibodiea. The compa«ti« data aat forth in Serial No. 
07/912.292 vis.a-via TCAE 5.2 and ANEX 1 is rdative to eapresaion of drimeric 
anti-CO20 antibody. 



TCAE 55 was darwed fhan the vector CU)N. a detivatiye of the «ctor 
RLDN10b(««.2S3&i*«« 77.91. 1991). RLDNlOb is a derivative of the vector 

15 TND(s«;7DJVrA661-661.l988)./ethevectorTb«ilyline-iaa.ttB6WB:TDN 
- RLDNlOb - CLDN - TCAE 5.2 - ANEX 1 (the use of the symbol is not 
intended, nor is it to be construed, as an indieatioB of tha dfiwt necessary to 
adueve the dumgea fiom one vector to the next; t*. to the contrary, the number 
and complemty of the step. n««.«T to geaenite TCAE 5^ fh,« CLDN were 

20 extensive). 

TND was designed fbr hi^ level expressions of human tissue plasminogen 
activator. RLDNlObdiflbraftwnTNDInlhefiDowingwayB: the dihydrofoUte 
reductase CDHER") tranaeriptiBnal cassette (camprislng promoter, murine 
DHFR d)NA. and pdyadenylation region) waa placed in between the tissue 
plasminogen aedvator cassette rt.PA expre.«on cssette") and the aeomydn 
phosphetranaferase (-NEO* cassetu") that all a«. «.ettes were in tandem 
end in the same t«««oiirtional orientation. Tte TOD vector permitted selection 
with C418 for eeUs carrying the DHFR. NEO and t-PA genes prior to selection 

■30- 



25 



10 



IS 



20 



25 



...... pcr/tJS93/imi 

for DHFR gme amplificatioa in wponse to mtttaoaaate. MTX. Tb» promoter 
in fhmt of th. DHPR gw* wm ehaiig«d to tho mouse beta gleUn ouiior promoter 
(«..3MotC.flaa 1846.1284.1983). PiaaBy. the t-PA cDNA WM replaced by a 
poWIinker toch that diSi»«nt genes of interest can be inserted in the polyUnker. 
AU th«.B eukaryolic tr«««iption.l eas-tte . (t-PA. DHFR. NEO) of the TNO 
vec«,re«.be.*p.r«edfh»th.bacteri.lpl.«aidDNA(pUC19de,ivativ^^ 

digestion with the mtriction endomideaee Not L 

CLDN differs from RLDNlOb in the foHowing ways: Tb^ Rou. LTO. positioned 
in front of the polyiinker. was replaced by the haaan qrtomeealovinu immediate 
early g«.e promoter enhancer CCMV-). (see. 41 Cefl 521, 1986). ft«a the Spe I 
site at ^81 to the Sst I site at -16 (these number, are frem d» Cea nrflsrence). 

A. the name indicates. TCAE vectors wet. desiin«l to high tovd expr^irions 
chimeric antibody. TCAE 5.2 diflbr. from CLDN in thaiatewingw.3^: 

A. TCAE 5.2 comprises four (4) transcriptional ea8sattaa,ai opposed to three 
(3). and these are in tandem aider, ie .human immunoglobulin H^t chain 
8bsentavariableregion;ahumanimm«no^dbuHn heavy chain abeanta 
variable regio«;DHPR5andNEO. Ead» tr««rip«iowa contains its own 

eukaryotic promoter and polyadenyUtin lagiOB (rtfewnta i. made to Figure 2 
whichisadiagnBnmaticr«pr«e«t.tlon«fthaTCAB5.2v«rtor). TheCMV 
promoteWenhancar in frtmt of th. immimogbbulin heavy chain is a t^ 
veraion of thapwmotertotoar in front of the Ught chain, from the Nh^ 
at .350 to the Sst I sita at .16 (the muDbar. «• fitmi the CWt f«te««. 

SpedficaOy. 

1) Ahumaaimmunoglobulinlightchaineonsttntragionwasdeiived 
viaamplificationofcDNAhyaPCRreaetion. InTCAB5Athi.w« the human 
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iM»uwloboli»Bghteh.i»l«W.eo«««tT«i«»(ICi*««»^ ^ 

«id. 108-214. .lUWK«3).-«*th«>«««»««»«««^»*^ 
^ 1 ean««««gi«»(Kd«t«««b«i»g«i«»-dd. U4-478. aUo^ype Gmla. 
0«dt). Th.liglitehaiawMteolrt«lfro«nonnalhu»anWod(IDEC 
pha«««»ti«U U Wl.. CAh RKA ti«»fr«n wa. ««d to 

««th«l*. cDNA which wMth«««plified usingPCRtechniquea (primers wer. 
d.riv.dvi.*vi.ih.cao.«»«.Kabat). heavy chain w« i-<^ted (us.r^ 
PCBt.eh«i<,»..)frc«d)NApr.P^fto«RNA Which wa. in t««d«^^^ 
criUtr«»rf«««i wlthah-unlgGl vector(«e.3 Fn^. 531. 1990; vact« 
pNn62>. Two«»«.add-we«ch«uM«th.i.ol.««dha»«nI«Glto«*tch 

. . vahat. ts wit 801110 add 225 wa« 

thaconsMBitt amino acidw<|u«i»uiKai»t.»wK. 

, . in^t^ncjn and anuBO add 287 was changed 
dangad fiom valine to alanme (GTT to GCA). ano aiiww 

from matWaniae to lyiina (ATC to AAO); 

2) ThehumanimmunoglotaKnHghtaadhawychaineaaaattaa 
cont-n synthetic «gnal »«i«.«ce. for uc^lon of th. i-mu»^^ 

3) Tliehumanimmunoglobulinlightandheweh«tocaa.ettaa 

conuun .pedftc DNA «.tnctio« .ii.. which aUowferl««tion oflight-ui haavy 

20 i««unoglobulinv.hablereg5on.which«««t.i»that«i^^^ 
anddonotdtarth.Mni«oadd.nonnaD,fc«ndiai«nu«i«ld«^ 

4) TheDHFRca«.etttcont«inediUow«a«k«yrtie|»tm»t«rO^^ 
betaglobinn»iorpro«ot.r.-BETA-i«dpdjrtd«i]totionr.glODfto^^^ 

25 hormone polyadenylation,"BGir): and 

5) The NEO eaasetta ce«t«n«i iu own a-tanrotie p>«BOt«r (BETA) 
andpolyad«»yiatio««gion(SV40e«iypoIyida«^»«*«^'^^ 
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With raspwt to the TCAE 5^ aad thfl NEO easMtte. the Kotak nsion was a 
Gonseanis Kenk (whidt included an upstnam Cla I rita SEQ ID NO: TY. 

Clal -3 +1 
TTSGGAGCTTGG- AICGAT CcAcc ATC GtC 

ANEX 1 (previously named TCAE 12 in the refereoced ease) i« identical to TCaE 
5.2 except that in the NEO cassette, the Koxak region was folly impaired (SEQ 
IDNO:S): 

Clai -3 *i 

TTCCGACCTTCG ATCSAT CCTcc AtS Ctt 

As disclosed in the eonaonly-assigned refeteneed ease, the impact of ulili«tion 
of the fully impaired consensus Koiak was stiilmir. w»»ti»e to TCAE 5.2. there 
was a significant (S-Wd) twiuetion in the number of ANEX 1 0418 resistant 
colonies (258 ftum two eleetroporationa wreus 98 from sis electroporational from 
the same amount of plasmid DNA tMnsfcetad per ceU; and, there was a 
significant increase in the amount of co-linked gene product eipressed in each of 
the ANEX Idones. ItefciencingtheUatogramofPigttre3(Pigttrel6ofthe 
commonly assigned lafereneed can\ 258 colonies were derived from 2 
electropontions of 25 uf of DNA esntaining a neomycin phosphotransferase gene 
with a consensus Kosak at the translation start sitt. Two-hundred and one (201) 
of theee eoloniea did not express any detectable gene product Oess then 25 ng/m» 
of chimeric immuno^obulin). and only 8 edtonies expressed more than 100 ng/tal. 
Again, referencing Figure 3. 98 coloniM were derived from 6 electroporations for 
ANEX 1 of 25 M of DNA containing a neomycin pbosphotrensferase gene with 
the fiiBy impaired consensus Kozak at the translalion start site (6 
eleetropontiona were utilited in order to generate stotistically comparative 
value8;thi8i»a8b.caiiseonavenige,eachelectropo«tionforANEXl yielded 

about 16 colonies, as opposed to about 129 colonies per electroporation for TCAE 
5.2). Eight{8)ofthe ANEX 1 colonies did not eipress any tooetaWe gene 
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praduet than 25 ngtel). whUe 62 of Uwm «lo« s wm «pr«»lng greater 
than 100 agtal: of these 62 eolom«.. nearly 23 weiaafpiwainR over 250 affm\ 
{23»). with 6 asptcssing greater than 1000 ngtai 16%). 

The!breg«inge^deace..i««.r«/Jo.thefe«o«ng: 1) beeauae the difference 
b.twe«i TCAE 6.2 and ANEX I waa Uiaited to the Ko«ak translation start site of 
the NEO gana. and baeauae the gene product of interest (chimeric anti^D20 
antibody) waa co-linked to the NEO gene, a emdusion to be drawn is that these 
differences in results are attributed solely to the differeneea in the Kozak 
txansUtion sl«t sita; 2) it was eiperimentolly confirmed that utilization of a 
fully impaired consensus Ko«k in coiyuaction with a dominant .el«*iW. 
number resulted in significanUy less viable colonies: 3) it was exp«im«ntally 
confirmed that utilization of a fiiUy Impaired consensus Kotak in conjunction 
with a dominant selectable marker co^inked to a desirwl gana prodtMt 
signifieanay increased the amount of exprwsed gen. product. ThM. tha number 
of colonies to be acreaned deereas«i while the amamit of e^saad tn» product 
increased. 

IL IMPACT OP OUT-OF-PRABIB STABT SBQUBNCB 



Conceptually, further impairment of tamslation initiatioil of tha dominant 
s electobla markar of ANEX 1 could ba aSbetoatad by ntiliaatian af at least one 
out^-frame ATO start eodon upstream of the naoniydnphoaphotrmiaferase 
startcodon. Taking this approaA one atop farther, tttiliaattonof a secondary 
25 ,trTictuierhairpia"^wWchineorporatsdtha«»Bt.«teodoninthiat^^ 

thereof. woiddbapre«imedtofbrtheriahibUt,M*««^ Thus, when 
the outM,f.fhfflie start eadoa«tany Impaired consensttsKoadt waa conaide^^ 

region was d.«gn«d a»^ that the posaiWUty of aud. aecondary rtruetuies was 

increased. 
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As indicated pf«vioudy, the Koiak region fer the neo start codon in the ANEX 1 
veecoris: 

TTCCCACCTTCC ATCCAT CC Tcc ATG Ctfc 

5 

The desired sequence for a vector identical to ANEX 1 but incotpomdnc the 
above-identified changes vis-a-vis the neo start codon. referred to as ANEX 2. is 
as follows (SEQ ID NO: 9): 

10 CCA GCLJC5G AGO A ATCGAt CC Tcc ATG Ctt 

Cnieout.of.fiame start codon is underiinedJ T1iefiillyiiiipair«d consensus 
Kozakof.ANEX2isidantiealtothatofANEXl. Tlie pmcipal diflhrence is the 
inclusion ofthe upstream ouvof-frame start codon. A possiblo dSf&rance is the 
15 fonnation of a secondary structure involving this sequonea. imposed as fiHows: 

CG } 
T A } CLA Z Sice 
A T ) 
20 AT 
GC 
GC 

Ar 

GC 

25 GC 
TA 
AX 
Cfi 
GC 

30 AT 
CT 
CG 

The sequenco in bold, ATG. is the upstream out^rf-frama start codon; the noop- 
35 portion of the secondary structure U the OA I site; and the 8«|^ 

the "T and "C" (italics and bold) is the start codon (underlined) of the fiiDy 
impaired consensua Kosak. 
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^lfre«XhoI(5520,toCl.I(5901);«..R,««4. IW« — « follows: 
a 3-.PtlB«r 489 (SBQ ID MO: 10): 

?S Sc TAT GTC »« *eC AAA TGT 

The uppwUned portion of Pninor 489 « a Cia i bh. 
the fdly i«paiT«i Ko«k tnu»lati«n .t«t 

15 5-prliner488<SEQ.lD.NO.U): 

5. -etc 660 2fff SBJ Ctt T6C-3' 
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The upper-toed portion of Primer 488 i. an Xho I site. 
^e«p:i-«.we«p«pa«du-in.«.AB13SlPCRMATB«DNA«rnth.dx.r 

<AppUedBio.y««n..Po.tera*y.CA). Pho.pho«-idit« 
C««*ein(Gla.gaw.Scotl«uD: dAthx) • Prod. No. 20^120-21; dO«b«) • P«d. 
No. 20^110-21; dai«).Prod. No. 20^imi;T.Prod No. 20-8100.^ 

r«croUt««-)ofanti^iaTCAB8.2i«pl«n»dgrowain&cofi.«^ 
QM48(obtai»odlh«th.ATCaw..«tai«d*lthmofde»o«i»dw.^^^^ 

Prinier488(64p««l«);««i*Xofpn«er489(S6p-bl..). Thi. «. filW by 

.den.tu«tioa«.p(94-C.5»n.)«^.r««ta«tio.«.p^'^ 

The«.fUr.4Xof8»«.dim^(F"-i^.*^^"^"*^'^'^^ 
dCrrP.PnKLNo.U1221;dGTP.Prod.No.Umi;<riTp.Prod.No.U123UUof 

PfuDKApoly«.r«e(St«t..en..UJdl.CAProiN. 800135. 2.SU/J.1),J^ 

SOXoTHdneraloUoverl., was added thereto. ftllow«l»V 30 qM«i.irtth.«* 
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cydec the fdtewing: t2n:.2«tfn.94-C.X-in.;54-C. X-u. Ten 

^t«. (lOX) rfthi. «!-ixtu« w« ««ljr»dby «•» .l«:«xopha«,« 

,«srft8 «t rt-wn): a ri«^e band *M toundat «J««t 400 b-e pair.. 

O. l«dXho l.»foaow.:20Xof»ti^D20inANBCl w««ln«edw.th lOXof 
10XNEB4 buffer (New EngUnd Bidabs. Be,^. MA; b.«io.fUr. NEB): SK O. 
1 (NEB. Prod. No. 197 S. 60«V. and 64X ddoniaed wUr. IW. .toixto* 
.0 incub.udov«u^t.t37n:.fonow«ibytb.«»ditl«rf8XXh.l(NBB.Prod 

No.l46&100tt)andi»eubatien.t37.Cfcr2h». Tha re«ldnf 

deaii:n.t«ih«««a.-Cla VXholcutANEXl^ th..«»«^ 
PCR«^twa.p«p.«danddi^...dwlthCUl«.da.l..fbll^^^ 

ch, PCRfragmant wa. .d«i«d with lOXaf SMNaOAc; lX10*«,di«in doda^ 
15 «lf.taCSDS>.«»d90X|W«HCla««wtT1d.«i-^ 

30 sec followed by a 1 .S-in (1700 BPM). 11- '^'^ 
toaspin «olu»» which r.«dt«iin8SXtot.l.dndxt««.Tbthia.d-«rt^ 

add«110X10XNEB4.Ubavin...n«an«»«n(BSA.100X;NEB). 3.0.1 
(24u).and2XXbol(40o). Thi..dn»«««w«ii-c«ba*«l.tarCfar2h«. The 
,0 „sul.in.«atanalUde«gnat«lb.raia..-a.lflttale«tK:R488«8^^ Both 
aal>Xh.lcuiANEXl«d a.l/Xholc«tPCB488/489w«.a».lyudby 

a6.«««gelelectnn*««U«»dth.««l«»f»-«Uw«.eb.^ 
relative loeatiai on the gd (mtfto »»t 
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DNAIif«Ut.Pnrf.N..UlBI»»Jtt««>»'l'™««*"«~'°"- 
„^«„^«..d-.«d-U..«aMK.O*cU ,»SDS.fflXTE.»a«X 

p„t No. MS79S0B) ">Pl«Ui>> *^ 

S«t«n(Pr«lN..Pil-A7lOO).fc«»*«--»a«'-«'»«™^ 

. ^ ♦w^ Ahttnt 2 vftctor depending oa the sufficiency of 
20 plasmids may have comprised the ANEX 2 vector, oepen«»uB 

the foregoing. 

*^ P«™«»p*"" - n,^. 9V of fiach isolate was 

,HZB.P--.N..t«.10-X.:«-«ftH«.Tbi..i-««'«»^«'' 
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(results not shown); niiw (» of ths bands wn sabstantially Ideirtieal to the 
ANEXl standard. ni(i) ghowsd a sB^t dilBweBce in band pattern. Fortius 
single isolate, tha first two bands weii at Iffll and 670 tt: for the ANEX I Hinf 
I dlgistadpr»d«rt. the first three bands were at 1691. 766. and 670 kB. The 
S missliig band at 786 kB for the single isolate was attributed to the presence of 

the Hinf I site therein, indicating that the desired change to ANEX 1 was 
incorporated into this vector. This vector was designated "Anti-CD20 in ANEX 2 
(Gl JO." and is generally referred toV the inventor as ANEX 2. 

10 Electroporation of anti-CD20 in ANEX 2 was accomplished as foUows: two- 
hundred and forty nicroliters (24«t) of the anti^20 in ANEX 2 DNA (400Mg) 
w» adaixed with lOOX of 10 X NEB2 buffer ; lOOX. of Sttt I (NEB. Prod. No. 
187S. 1000 tt): and 560XTE. and incubated at S7^fer2h»8. Hii. adnrfrtore was 
then placed over 8 spin columns (125X eacW. Mowed hjr addition of llOX lOX 

15 Not I buffer (NEBk lOX lOOXBSA; and 20Xof Not I (NEB, Pwd. No. 1898. 

800u). This admixture wasincubatedBt37HJfiw3hfa..fiflow.dhytheaddition 
ofl20Xof3MNaOACandl2X«flO*8DS. The admittaw wa. tnwrfanid to 2 
vortex tubes and 500X of phenyI«HCWisoamyl was added to each, fidlowed by a 
30 sec vortex and I min. spin (1700 RPMl nMfaqprnoua piiase was removed 

20 fromthetubesandsegr«gatedinto3t«bes.fcltowedVyth.addltiimtoeachtube 
of.20»C100%BTOH.foIlowedbyl0min.spin(13.0001BPM). Thereafter. .20«C 
70*ETOHw.saddedU>eaehtrf)e.fi»1to«db,lmin.si«n(13.000BPlO. The 
tubes were then pla»dinaSpeedVAC«f»rdr3mg.fbnow«dlv««»3P«^ 
the contents in lOOXTB in a Bt«ila hood. flv«mie«»Bter»(5X)«ftha 

25 resu,pendedDNAwM«taixadwith99ttrfdd«»i«di«t.'(lja»diJ^^ 
Aft optical density reading wa. taken (OD.260) and tha amMnt of BNA present 
was calculated to be 0.75Mgflt fa order to utilise 28 M of DNA for 
e»ectroponition,3aiofth. 1300 dilution of the DNA was utiliaed(25Mwas 
utiBsed as this wa. tha mnount of DNA tttiliMd for TCAB 5.2 and ANEX I in the 
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fo«goiBgE«aid6 U Tte 1:200 dattttoDrfDNAw«fenB.Uy«lte« 
-Sttt I. Not I cut anti.CD20 ia ANBC 2 (25m«) in TE" and ceiwraUy wfemd to as 
-aBti-CD20inANEX2.' 

Host edla «till«d w« D044 CKO IXHO") U«. Urloub. O. flom-ite CH. IMS 
,„p«). On. hundred -mffitw of 6.6 x 10$ cU./«l (84%) w.« .ubiectod to a 2 
«a. vin at 1000 RPM. These were wariied with 60 «1 «cn,8. buffered 
solution, followed by 6 ain. spin at 1000 »PM; the material wa. then 
r„„,pendedin4J5naofth.8uen,.ebufferedeoltttion. Thereafter. ceUs wore 
counted and 0.4»lofCHOcell.(4.0xl06 cdl.) we«.d«i«dwith SOXofth. 
antWJDZO in ANEX 2 in BTX .terile. diaposabl. electwporation euvottas. 
El«etroporationsetting9werea8foUow»:2lOvolto; 400 nderoftraday 
cp.eit««e; 13 ohnu re««tanee, using a BTX 600~ d«*« «dl namipulator 
(FIX. San Diego. CA). Nin.(9)eleet»porationsw.«eondttrt«i:«ct«.l^t«»«e 
deUvered over actual time, were a. Ibllowr l.M9V.4«a»ec2.188V.4.67 
msec; a-lSSV. 4.24 msec. 4.200V; 4.26 nuee. 5-200V. 456 ».ee: S-ISSV. 4.26 
msec: 7-180V. 4.59 maer. MSSV. 4.57 asae: 9-201 V. 4.24 a»ec (A. noted in 
Example I. the differMwe in numb«" of peifonned eUctioporationa waa 
attributed to the need to achieve a atatiatieaUy aignlfie-it number of viable 
colonies for each of the three condition.. TCAB8AANBXla»dANEX 2: the 

amount of DNA used for e«:b etaetwporttion (26 |lg) th. sane tor «ch. and 
the same number of cdta wera eUetrepomtad. 

Tliere.fter. the dectwpomtion material was admi<«l with 20 ml of 0418 
Growth M«lia(CHO*SPMnmin«shyiwanthina and thymidine (Gibco. 

Grand Island. NT. Form Mo. 914)458PK»ndttding 50 mM h|pd««tMn» «id 8 
^Mthymidine). m admixture wa.genflyagiUt.d.fcnow.d by plating 200 m1 of 
the admixture per waU into 96.well plates, one plate fbr each electreporation 
(nine). Beginning on day 2 after eleetn.porati«n.th«wgh day 17. ISOjd of each 
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weU was ramoved. aad ISO vi of ficsh G41B Grawth Media wnt a iiriTig 400 iigrml 
G418 was added thtnto. Colonics wnc analynd on day 25. 

Om hundred aad twenty one (121) colonies expressed anti-CD20 antibody Ue . 13 
colonies per electroporation). Of these. 63 (62*) ezpreesad ow 250w/tal of 
protein; of the 63. 20 of the colonies (16.5*) expressed over I000)»gftnl of protein. 
Only 5 of the 121 colonies (4.1*) expressed less than 25>»g/ail of protein. Figure 
5 provides a histogram comparing expression of pwtwn per eolimiea derived from 
the vectors TCAE SX ANEX I and ANEX2. 

The foregoing data indicates that, inttr ofio. as between ANEX 1 and ANEX 2. 
the use of at least one out^f-frame start eadon uprtream of a fiiBy impaired 
consensus Kotak associated with the translatioB initiatien of a dominant 
selectable marker decreases the number of viaUe colenies expzesring co>linked 
gene product and signilieanay increases ibt amount of expressed ce4inked gene 
product. 

m IMPACT OP INSERTIW OP GENE PRODUCT OF INTEREST 
^a^OFAOOBIIKANTSEUCTABLEMABKEB 



Building further upon the ANEX 2 vector, an artificial splice was generated 
between amino add residues 51 and 52 of the NEO coding region l»f ANEX 2, 

25 followed by insertion therein of the anti-CD» encoding legieD. Two such vectors 
were generated: the fint, comprising a censeDSua Kexafc sequence for the NEO 
translation initiation codon aad not comprising an outrof-frame start eodon. is 
referred to as "GKNEOSWASF:" the second, eomprisiag a fiiBy impaired 
consensus Kosak and an outHtfftame start condon. is referred to as 

30 'NE0SPIA3F.' 
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Both GKNE0SPLA3F and NE0SPLA3P contaiB ttefoDowing axtUleial iniron 
sequ nee between taOant add wddue. 51 and S2 of NEO: 



y CIC WCfCTAAOTfiCaflCMCtikCTAAC (TOs CT(03 TCC (Tls C CTOCAOOAC OAO r 

Tha und erUned portion reprwtttt a seduMiee amenaMa to dl«a«tion with Not I 
enxyn.e;the«.eodiiigregionforant^CD20.lnttrol<a. waa iaaarted within this 

region. 

Although not wishing to be bound by any particular theoiy. the inventor 
postuUtM that duiing e«pre«aion. the induflon of a gene product of interest 

within an artifieial intionie inwrtion region of a dominant seketaUe marker (e.*. 
the NEO gene) should aignifieantly decreaae the nuinber of vWde colonies 
producing, in th. caae of the disclosed GKNEOSPASF and NE0SPLA3F vectors. 
anti.CD20 antibody. This is predicaud upon two pointa: fiiat. only those vectors 
which are able to transcribe mid eorrecti, spKea^t the mitibody encoding region 
and correeUy trmialate NEO will be C41B Rsietmit; s«»nd. because each 
antibody cassette has its own promoter and polyadanylatfon region, transcription 
and translation of the antibody is independent of translation of NEC. 

The GKNB0SPLA3P and NEOSPLASP veetora were eonstnieted in the foHowing 
manner 

Anti.CD20 in ANEX2was digested wiaiNotXandXhoIin order to isolate the 
1503 bp NBO cassette DNA fragment (see Pigure4betw«en "Not 1 7023" and 
-XhoIS52(naa£Wlow8: iO|dofant^CD20inAMBX2wns.d«ixedwith6ul 
deio«sedH/)rdHjO-):lplNotXen«yms(NEB.Prod.No. 189S):2plof lOX 
Not I digestion buffer CNEB; provided with enxyaa): mid 1 |i Xho I enjyme 
(Promega. Madison. WS.P«od.No.R4164). TWs digestion mixture was 
ineubatedovermghtatSrC. The resulting digested DNA was siaa fractionated 
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by 0.8« agawae gel elee«ph«re«i8 and the desired ftagmnt migratiiiK at 1503 
was isolated via the GIasaMAX~ method (Gibco BBL, Grand IsJand, NY. Prod. 
No. 15590-011) for iaeertien into pHueacript SK(.) plasnnd DNA (Straugene. La 
JeUa.CA). 

pBlueseript SK M was previously prepared for acceptance of the NEO casmu by 
double digestion with Not I and 5Qw I using the saae eonditiona as above for 
enU^D20 in ANEX 2. Digested pBluesocipt SK (-) was then collected by ethanoJ 
predpitatioo by the addition of 70 (J dHTQ; 2 pi tRNA (Sigma. St. Louis. MO. 
Prod. No. R^OS): 10 |il of 3M NaOAc and 300 gl 100^ BTOH (-20-C). ThU was 
followed by a 10 min spin (13.000 RPM). decanting the supernatant, rinsing with 
70* ETOH. decanting the Uquid. dryingin a SpeedVACW .ad resuspendingin 
20MlltTE. 

Ligation of the NEO cassette DNA fragment into preiwred pKueseipt SK (-) 
vector was accomplished as fUlows: 10 id ofNEO fragment DNA was admixed 
with 6 111 dHjO; 1 Id cut pBluesctipt SK(.) vector DNA: 2 m1 10 xUgation buffer 
(Promega, suppUed with enryme); and 1 |d T-4 DNA Ugase (Promega. Prod. No. 
M1801) foBowed by incubation at 14«C overnight Ugaled DNA waa collected by 
ethanol precipitation as deaenTwd above for the pm««tion of pKuesoipt SK ( ) 
vector DNA. 

Ten (10) Ml of the iMUspoided figated DNA was »«nslbrmed into E. eoli XL ! 
Blue™(Stiatagen«).fbll«A»g««nu«seturerinstnietions. Ten (10) bacterial 
colonies were inoeaUt«i to LB birth (Gibco BRU PnA No. M27950B) including 
ampicillin(S0|igtol;Sigm..P«od.No.A-9393). PUnrid. were isolated from the 
10 cultures with a Promega DNA puriUcation system (Prod. No. PR.A7100). 
following manufBctuiw inrtwetions: these plasmids may have comprised the 
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plasmid r.fomd to as BlueNEO* dn^ndingon the riffideney of th. foregoinB. 
(BlueNEO* was confinned d« to Uie suiBeieiiey of the foUovrtng proeedoro.) 

BiwNSO* cMvttUis • Mot I wrt»i«lioo roeognitloii saouMMt ntmnd upon 
5 HBation of th. NEO eassotu frtf««t DNA inte tho pBIUMertpt 8K (-) vector. 
Thlsslts w..de.troy«lbyth.lMtowi««: i ul of MutNEO* DNA w« admixed 
with 16 Ml dHaO: 2 ^ 10 X Not I digwtioo taffT <NEB): M Not I enzyme (NEB). 
Thl,w..foUow.dbyineuba«kw».t37«Cfcr2l«. TWs digested DNA was then 
puhf.edbyspincoluanfraetioiiatfon»«uIti«wlnl5tffiaalvpl«^^^ This 15^1 
10 NotIdig«rt.dDNAw..Tdont<nded"tyadai«gwith4MlSXKlenowbuffcr 
(20 »M Tris-HCL. pH 8.0. lOOmM MgCl«) and 1 yl DNA Polymerase I Large 
(Menow) Fn^mtnt (Pnm«ga. Prod. No. M2201). This admixture was incubated 
at mm tamperatmo for 30 minotaa, Blunt^ded DNA was then purified by 
spin column ftMtionation. giving a final volume of 15 vL 

Ligation of the blunt-ended DNA was performed in an analogous way as «. the 
ligation of the NEO cassette fragment DNA into the pBlueseript SK {•) vector 
except that the final DNA was resuspended in 17 nl of I XTE. 

Following ligation, the DNA was subjected to a second restriction digestion with 
Notlby mixing the 17 ylofDNA with 2 pi 10 X Not I digestion buffer and 1 pi 
Nonenzyme (NEB). Digestion was allowed to proceed at 37«C for 60 minutas. 
PoUowing digestion, the admixture was purified by spin column ftaetlonation 
resulting in 15 ytX final vdiime. 

Ten (10) Ml of the purified DNA was transfonnediato E. coU XH Bltte** 
(StraUgene).foUowingmanufacta«rinstruetlons. Ten (10) batt«rial colonies 
were inoculated in LB broth (Gibco BRL including ampiallin (50 iigfail: Sigma). 
Plasmids were isolated from the 10 cultures with a Promega DNA purification 
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system foUewinc aanuftctum insBuetioiis; theat pluaids may h»v« comprised 
the plasmid Toured to as BluoNEO- dopondiaf on tht suffieioney of the 
fortcoioff. (BIueNEO. «u confirmed due to the wiBcency of the (dOomag 
procedure.) 

BIueNEO- eontaias o unique Pst I re8tricti«m «iu spanning the codons for amino 
add residues 51 and 52. BIueNEO- was digested with Pst I as foUows: an 
adfflixtui« was formed containing 15 pi dHjO; 1 BIueNEO- DNA. 2 vi 
digestion buffer 3 (NEB) and 2 (d Pst I eniyme (NEB. Prod. No. 1403). This 
admixture was incubate at m for 3 hr». Digested DNA was then purified by 
spin column fractionatioii. The following synthetic oligonucleotide was then 
ligated to the Pst I cohesive ends of BIueNEO-: 

5- CCTAAPTOCG0CC«;tACIAACTCTCTCCT^ (SB9IDN0:W>and its 

complementary sequence: 

$t»AAAAAO0A0COACCACACA<mVUrPiUS00GCC(MACTIACaGCAr (8BQ ID MO: U). 
Insertion of this linker creates a consensus 6' spUee donor sita (by ligation) 
followed by a Not I siU, foUowed by a consensus spUee branch point, followed by 
a synthetic polypyrimidine tract. foUowed by a consensua S' spUee acceptor site, 
as indicated above. 

Ligation was performed as described ab«e fi»r the Ugation of the NED cassette 
into pBlueseript SK (-) eacept using 2 |il of Pst I linearised HueNEO- DNA and 
14 111 (176 pmoles) of annealed comptementary oBgonneieotidas. 

25 The foregoing (and folUwing) synthetic oligonueleBtideB were chemically 

synthesixed usiiig an Apptied Bioaystams 391 PCR MATE« DNA Synthesiser 
(Applied Biosystems. Poster <Sty. CA). AB leagenta fi»r the synthesis were 
purchased from Applied Biosystms. 
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Ligated DNA wu coUeeted by ethanol prwipitation as deserib d above Car the 
preparatioo fpBlaeseriptSK(-) vector DKA. 

Ten (10) vl of the resMpendwl Ugated DNA was ttaMfomed inta E. coli XLl 
5 Biua«» (Stfataiwia). ftlUwrtng aianolbetew toetfuotioaa. Ton <10) bacterial 
colonies war. inoculated ia LB broth (Gibco BRL) uidwBng ampidUin (50 ugfmU 
Sigma). Plaaarids waxa isolated flrom the 10 eulturaa with a Promega DNA 
puiiSeatioB system. foUowing manufiietunr instmetions; these plasmids may 
have comprised the plasmid tefened to as NEOSPLA and/or NEOSPLA- 
10 depending on the suffideacy of the fcregoing and the orientotion of the insertion 
of the (digonudeetides. 

Deurmination of orienUtion of the splice junction Unker was preformed by 
nucleic add sequendng using the Sequenase Version 2.0 DNA Sequenring Kit 
15 (United States Biochemical, aeveland, OH. Prod No. 70770) iMIowing 

manufacturer instructiona. Upon determination of linker orientation within six 
independent jdasmid isolates, identification of NEOSjPLA was made such that 
the inserted splice junction sequences are in the correct forward orientation with 
respea to the directien of NEO transcription. 
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NEOSPLA was digested with Xho I by forming an admixtiae of 15 ill dHjO; 1 pi 
NEOSPLA DNA; 2 10 X digestion buffer D (Pramega. suppUed with emymeh 
and 2 ^1 Xho I ensyme (Promega, Prod. No. R6l«l). TWa admixture was 
digested at 37«C for 3hps foOewed by DNA purifistion by spin eolnmn 
fractionation. Into this site was ligated a self eomplemantary synthetic 
oligonucleotide having the following sequence: srctuoumhr fflBQiDNam 
Insertion of this sequence eflbetively dauiges the JCho I site tb a Pac I restriction 
site (as underiined in SEQ ID NO: 14). 
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Ug»tiimwp«foni»dMcte8Cribedri»»efertl«Iig«tloa f ttw NEO cassette 
into pMueseript 8K (-) rnxmpt vmag 2 pi of Xto I Baeatteed NEOSPIA DNA and 
14 ill (176 pades) ©f aanealad eanpUBMitaiy oUgmudaotidaa. 

Ugated DNA was ealleeted by etbanol pmaiatation aa datoltoad abore for Iho 
preparatioo of pBluescript SK (-) vaetOT DNA. 

Ten (10) la of the resuipended ligated DNA wae tranafenaed into S. eoli XL-l 
BluoW (Stratagene). following manufectuxer inatnietioaa. Tan (10> bacterial 
colonies were inoculated in LB broth (Gibco BRL) indudinj ampidUin (50 us^inl: 
Sigma), nasinidswew isolated from tha 10 eultoias with a Piomega DNA 
purifieation STStem. following maau&eturerinstnietiima: thasa plaamids may 
have comprised the plasmid tefenad to as NEOSPLAS dapanding «n the 
suffidency of the foregoing. (NE(jsPLAS was cenfbaad dua to the auffieeney of 
the fdlowing procedura.) 

Ana^D20 in ANEX 2(01JD eontaint the antl<SD20 Ug^ dida and heavy dmin 
immunoglobulin caasettaa and a DHPB cassette bounded by a Not I «te at the 5' 
end and an Xho I dta at the 3- and. Anti-CD20 in ANEXaCl^ was digested 
with Xho I by fonmng an admistnn of ISuldHaO. 1 )il anti^M in ANEX 
2(G1 JD DNA. 2 Ml 10 X digastian bufifer D (Prameg^ supplied with enayme) and 
2MlXhoIentyma(Pr«meg^Prod.Na.B«6l). TIda admixture was digested at 
37«Clbr3hr8ibBowedhyDNApu«ifcationby8pin«diimnftnrtionation. Into 
this dta waa ligated a salf eamptanantary synthelie cligwudaotide of the 
following sequMMK rfoaMOaa££BCtT (mnwuixu Iniartianofthis 
sequence affbctiwiy dmag«s the Xho I site ta a Not I rastticlion dta (aa 

underlined in SEQ ID NO: 18). 
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WO 94^11^^ 

™«l M d«aibed «b»v. for the lifauon pf th. Ntu cas^etu. 
otpBlMKiipt SK (-(.W" DNA. 

„ With . ONA <^ >*>• 

foUowing procedure.) 

^. J •-*^«Mdi«ated«t37K;fcr 3 hrs followed by nte 
♦ 1 .-..-^fc.^. mj tha desired fragment 

Bugrating at 5516 ba«. p«i« by wa. isetated via tM « 
insertien into NEOSPLAS. 
^ J* «»M»««f the anti-CD20 cassette by 
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lOX Not I dieeatioa buffer <NEB); 1 m Not I enxyme (NEB): foUo^ed by 
incubation at 3r C for 2 h«. TWs digo6t«l DNA was th« purified by spin 
columo fraetionatioB i«8ttltin« in W 111 fiaal 



I vector was 



Ligauon of the aaU.CD20 DNA fragment into jweparad NE0SPLA3 1 
accomplished as follow.: iomofaati-CD20fr«««ntONAwa..dnn«d«.th 
6 ^idH20:McutNBOSn;A3vi»«ti,rDNA;2iaiO«ligationbttffer<Prom^ 
supplied with en«y«.>. «wi lul DNA ligass (Ptomsg.): feBawd by 
i«ubationatU'C««n»ght Iig.UdDllAwMColI«t.dtytha«l 
p,^pitationa.dMcrib«i.bav.(brth.prq«.tionofpBhu«»^ 

DNA. 



Ten (10) Id of the wauspended ligated DNA was wnsfenoad into E. coli XL. 1 
Blue«(Stnrtagena).fcIlowing««milhctumi««r«et^ Ten (10) bacterial 
15 colo«e.wer.inoculat«iiaI3b«th(OaKoBBUi«»dtrflng«npicaM(80M^^^ 
Sigma). Plasmidsw«rei«)UtadfifomthalO«ilti«swithaPromegaDNA 
pttTification system ftllowtog maaufi«*«m inrtruetSons: these plasmids ma, 
have comprised the plannid. rrfbrred to «i ^-CDTO in NE0SPLA3P and an ti- 
CD20 in NE0SPLA3R depending on th. wiffidency of the fo«going and relative 
orien»tipn of the inserted fragnwit with respect to NBO transcription. 
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Determination of orientation of the u»i^20 cassette insertion was preformed 
by doubledig..tio«withKi«I««ISpeI(HEB.Prod.No.l335)in NEB buffer I 
plus acetated BSA « ftBows: an •dmirt« comprisiag 4 ^1 DNA; 2 Ml NEB 
buffer 1; IMI Rpn I; 1 111 Spe I; 2 III BSA; and 10 Id dH# was fomed. The 
admixtare was dig«rt.d et 37^ for 2 hrs. followed by si« fractionation on an 
0^ agarose gel dectiopbofesto. Upon determination of anti-CD20 insert 
orientation within rix independent plasmid i«.l«f.. id«itifieation of anti.CD20 
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in NE0SPLA3P was aadB such that the inwrwd 8«|tt new ar in the forward 
orienution ^th raspeet to the direction of NEO traMcripti n. 

The 5S 15 hp antilCMO ftagmwit eoataini the SV40 oriciD. a chimeric mouu 
human iBmuno^ob«Bn light chain transcriptionai caaaatta. a chimeric mouse 
human immunogluhuKn heavy chain transcriptional caasette. and a murine 
dihydrafolate rwiuctase transcriptional cassette (see. Figan 4). 

Anti^D20 in NE0SPLA3P was douUy digened with I ««I Stu I by creatine 
10 theadmixtareoonriatingf 14,adHAlMl-nti-CD20iaNEOSPLA3F.2^1 10 

X digestion bufibr I (NEB. supplied with enryme). 2 Ml W « «cetyiated BSA (NEB 
suppBM with Kim I anajtne). 1 ul Kpn 1 ensyme. 1 Ml Stu I en*yme (NEB. Prod. 

Nos. I42S and 187S respectively). TWs admixture was digested at 37»C for 3 hrs 
followed hy rise fractionation by 0.8* egamse gd electrophoresis and the desired 
15 rragnttntiiiigr»tingat9368basepainibywasisoUtedvintheGlasaMAX 

method. 

A PCR fragment of DNA was generated from TCAE 5.2. The two following 
synthetic oUgonudeotide primera were utiliied in the PCR reaction: 



5 primer 5' (3CATCC fiJSlACCGGATCC ATCGAOCTA<?rAGCTTrC C 3" 
(SEQIDNOtieh 



3primer.5-CroACn'AfiamAGA<K:0<K;C<WA<nTACCT(W 
25 TCCAGQGC3'(SBQ1DN0:17) 

The underlined portion of SEQ ID NO: 16 represents a Kpn I site, and the 
underlined portion of SEQ ID NO: 17 represents a Sta I aite. 



40- 



Th. PCR produet was digested with Kpn I and Sto 1 and thm Ugated into 
prepared anti-CD20 in NE0SPLA3F. 

Ugaoanrftha687bpfra«ineatintopTWdaiiti4:DM 
aecoinpUahed aa folleva: 

2 111 anti.CD20 in NE0SPLA3P: 1 »il SDS; 1 |d tRNA (Sgina): 11 Jil 3M sodium 
acetate (pH 4 J) smn admiaed. Foltewtogpbanel/ehloroewra iaeaayl extraction 
of the adaiatura. the DNA «a8 pradpltatad ftam dia anueoua phaM by addition 
of 270 wl athand Oea^M) and this wat «pan «t 18J)00 tpm for 10 Bin. 

Fonowlaf a T0» EIOH wash, tha DKA w«i mosiwdsd in 16 |UTE . 10 Ml of 
PCR ft«g»tiit ONA was adnisad with 6 pi dHiO. 1 Ml cut ant»^20 in 
NE0SPLA3P vaetar ONA. 2 Ml W « bufliBr (Prtwnega. suppHad with 

easyme) and I Ml T-4 DNA UfBsa tPnimega) fUlowed by incubation at WC 
overnight Ligatad DNA was eoUeeted by ethanol pnwpitation as desoibed 
abflvafcrthapraparationafpBlttaaaiptSKWwtarDNA. 

Ten (10) Ml «f tba resuspended ligated DNA was trana&nned into E. eoli XL-1 
BlueW(Stratagene).fonowiBgmanufecturerin8tnietion8. Ten (10) bacterial 
colonies were inoculated in LB broth (Gibco BRL. Prod. No. M27960B) including 
ampieiUin (50 »gfaH: Sigma. Prod. No. A.9393). Plasmida we isolated from the 
10 cultures with a Promega DNA purification aystam (Prod. No. PR-A7100). 
Mowing maau&cturer instructions; these plaamids may have comprised the 
pUsmid referred to aa ant«n)20 in OKNBOSPLASP depending on the 
sufficiency of the foregoing. (Confirmation was based upon sequence 
determination of the diff«r«a ragioos of GKNE0SPLA3F vs. NE(JSPLAaF.) 



The new plasmid difibra fhnn anti^20 in NE0SPLA3P in ita Iteak sequence 
for th€ NEO gene which i«: 
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TOT GIT GOO AOC TPO QAl£fiAl ec Aee ATG Gtt 

Clal SturtNEO 

5 for Anti-CD20 in GKNE0SPLA3P. and 
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TCT G CCA nCATGG AGG AAlXlCAlccTcc ATG at 
up«»«u»8ta«t St«rtNEO 



30 



for Anti-CD20 ia NEOSPLASF. 



Comparati*. of f ^-CD^ in TCAB 5 vKtor (conpnsing 

IffiO ^th conaenwa Koiak): ANEX 2 vector (eompnaing N^^ 
impair^ Iteak. ««l «P««» o«t^-fr«ne rtart 8.<p»««ce); NE0PLA3F (aati - 
CD20 ia»rt«l via artifteid intrtmie ins«tiOB fgion batw«« anano adds 51 and 
20 52ofNEO:NEOha.fenyi«pairedKo»kanda»«p»t«a«o«t^-fr««»«n 
sequence): and GKNEOSPLASF (anti-CD20 inserted via artifldal Intrcnic 
i^ertion region between .««• add. 51 and 52 of NEC: NEO ha, ««««««, 
Kozak). 



Twenty-five (26) Mg of eadiplasnld(dige8ted as follow.: antf-dWO in TCAE5 
end ANEX2.NotI: ■nti-CD20 in NE0SPLA3F - Pad; anti-COaO in 
GKNEOSPLASF . Pae I and K|« D was electroporated Into 4 « 10« CHO cells; 
these digestioni weie utilised to separate the genes expressed in Bamnial>an 
cells from theDNA used togrowtheplaanidinbacteTia. PdUnnngdigertion. 
EtOH predpitation of the DNA, and diyiag thereof, the ONA was res«.p««ied in 
sterile TEataconcentrationofliig/UL Bleetioporation conditians iweie as 
described in Example H. except that 230 volts was utilised and. fcUowing 
electroporation. the mixture of ceUs and DNA wa. maintained fcr 10 min. at 
tempennize in the sterile, disposable electroporation c«y«tt*. 



room 
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Foil wing elefetropomtiOB. cells were plated into 96 weU dishe. as shown below i„ 
Table I. based upon the e«peetedfre<n«iicyofG4l8te«ista»te^^ «M(asderiv«j 

from preliminary experiments: data not shown): 



TABLE I 
COMPARATIVE EXPRESSION 



Plasmid 
TCAE5 
ANEX2 

GKNE0SPLA3F 
NEQSPLA3F 



Tn>«sf»ctions Call, Pitted Wo. 96 Well Plates 
1 4x10* 5 

I 2zl0< 

5 2«10^ 



5 
S 
5 



10 



TABLE 1 (contiimsd) 



Plasmid 



No. G418 Resistant Frequency of 0418 Resistant Colony 
Cnlnaies ' nar Transfectad Cell 



TCAES 
ANEX2 

GKNE0SPLA3F 
NEOSPLA3F 



16 
16 
16 
16 



1 in 20.000 
1 in 100.000 
1 in 100.000 
1 in 1.000.000 



(CeUs were fed with 0418 containing media on days 2. 5. 7. 9. 12. 14. 18. 22. 26. 
30 and 34; supernatant ftT>m colonies was assayed fer immunoglobulin 
15 productiona«d»hacolo«..became«onlluentinthewU«ondaysl8.22.26,30 

and 34). 
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Figures 7A to 7C provide hist gT« multo aiid •vidence the p~«igt of 
coloniofl at a partieolar level of expreaaica. 

Th. 8M.mplM pn»«iM hmJn •« n.t »« b. «n.t«.d » limited to th. .pwific 
5 vectors. fiillyi»I«ii»«» consensus KD«kBequeneM.doia^ 

marker.. traMCiiptlonal«...tte..«.d/oren.n-.od proteins. ThefbQy 
impaired ctmsenso. Kozak. and the utiHxation ther«,f. are not to be eanatrued 
limited to ANEX I and ANEX 2 vectors. Similarly, the preterad faUy impaired 

consensus Kstak sequences and vectors in no tray coastitato aD adudsaiett. 
10 eitheractu.lorimpUed.thatthe.ear.th.only««l»enc.«orv««or.towhichthe 
inventor is entitled. The inventor is «»titt.d to the Ml l««ltb of p«t*ectlon 
under appUcable patent laws. I^erT«iiraeto« Incorporating fallylmpair^l 
consensus Kozak sequences h«re b.«»id«itiaodlvthailIv«it«r o» ANEX 1 and 

ANEX2 for purposes of daiming these TactortVy*iigMtiiifP»««^ 
15 comprisingthesevector.and.««^I»Ow.,.dnK»itodwlth*h.A».h««TVPc 
Culture CoDeetionOVTCC). 12301 ParWawa Drive. RocWH.. Maryland. 20852. 
undertheprovi8ionsofth.Bttd.p..tTV«l«yfcrthefat«natiaMJR«»^^^^ 

theDepo8itofMicrt)or«ani»itethaIN»h«»«rfP»t«»»I^^^ 
plasmids were tostodby tht ATCC onN«»«ob«r9. 1992. and datoimined to be 
20 viabl.anth.td.to.Th.ATCCh...Mlgn.ath..aia-odd.th.fcll«wi^ 

deposit nu«b«.e9l20(«Ui^inTCABl2(AHEX I)) «de9ll8(«.ti.CD20 

in ANEX 2 (OUOX fbr W*- of thi. dop-lt. P»«Hdd. 

into£Leoii. 

25 Althottghthai«»«rtio«l«b«nd«saibadtoc««d.rdaadrtiawi^ 
certain a-dKrfimwitt th.r«« oth« a-bodte-Bto ^th^ 

th. uaehing. of th. praaent invention are posiibl.. Accordingly, ndther the 
di«ao««««»th.«toi««tef6now.«.i«tond.d.«oral»ouW 
Umittd by tb. d««riptions of th. prrfk«.«i e«b«li«e«to eo«toi««l her.. 
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SEQUENCE LISTINO 
(1) GENERAL DlFOBMAnON 

(i) APPLTCAMT: lUff. Ifitdidl E: 
(H) Tmi OP INVENTION: taj«ngD«^ 

StraUgiu for Enhancement of . 
BxpmrioBofGenBProdaetand 
Expression Vector Systems Compnsing 

Same 

(lU) NUMBER OP SEQUENCES: 17 
(iv) CORRESPONDING ADDRESS: 

fA) ADDRESSEE: IDEC Phamaeeuticals Corporation 
(B) STREETi iiOUToaejrannRoad 

(O CITY: §^25?® 

(D> STATE: CjJifcniia 

(E) COUNTRY: USA 

(F) ZIP: 92121 

25 (V) COMPUTER READABLE FORM: 

(A) MEDIUM TYPE: Diskette, 3^ indu 1.44 Mb 

(B) COMPUTER: Madntosh 

C) OPERATING SYSTEM: MSJKJS 
30 (D) SOFTWARE: lyBcro8oftWordS.O 

(vi CURRENT APPLICATION DATA: 

(A) APPLICATION NUMBER: 
3S (B) FILING DATE: 

(C) . CLASSmCATION: 

(viii) ATTORNEWAGENTINPORMATION: 

40 CA) NAME: Biirgoon,RdutrdP.Jr. 

ffl> REGISTRATION WJMBER: 34J87 
(C) REFERENCE/DOCKET NUMBER: 

(ix) TELECOMMUNICATION INFORMATION: 



45 



(A) TELEPHONE: (619) 650-8500 

(B) TELEFAX: (619)5508750 



(2) INFORMATION FOR SEQID NO: I: 

(i) SEQUENCE CHARACTERISTICS: 

* (A) . LENGTH: 17 base* 

(B) TYPE: nodeicadd , 

(O STltAMDEDNBSS: eiBCl« 

(D) TOPOLOGY: linear 

^° (ii) MOLECULE TYPE: DNA(gBBOinie) 

(iii) HYPOTHETICAL: 7*» 

(iv) ANTI-SENSE: y«* 

(ix) SEQUENCE DESCRIPTION: SEQID NO: L- 
TAG CTA GGT CCT ACC CC 
13) INFORMATION FOR SEQ ID NO: 2: 



15 



20 



25 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENG TH: 17b««M 

(B) TYM!: nudmeadd 

(C) STRANDEDNESS: nngle 

(D) TOPOLOGY: linear 

30 (u) MOLECULE TYPE: DNACgenaniic) 

(iii) HYPOTHETICAL: yea 

(iv) ANTI-SENSE: no 

^^ («) SEQUENCE OESCRlPnON: SEQ ID NO: 2: 

ATC GAT CCT CGA TGC GC 
40 (4) INFORMATION FOR SEQ ID NO: 3: 

(i) SEQUENCE CHARACTERISnCS: 

(A) LENOT: 9 bam 

4S (B) TYPE: nttd«e arid 

(C) SriRANDEDNESS: amgle 

(D) TOPOLOGY: Bnear 

(ii) KTOIXCULBTYPE: DNA (genomic) 
^° (iii) HYPOTHETICAL: yea 

(iv) ANTI-SENSE: no 
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(») SEQUENCE DESCRIPTION: SEQIDNO: 3: 

TNN ATG CTT . ^ 

^ (5) INFORMATION FOR SEQ ID NO: 4: 

(l) SEQUENCE CHARACraUSTIOa: 

IQ (A) LENGTH: 9baws 

(B) TYPE: nucleic add 

(C) STRANDEDNESS: single, mdudmg mck 
©) TOPOLOGY: linear 

15 (ii) MOI^COLETTPE: (DNA (genomic) 

(iii) HYPOTHETICAL: yee 

(iv) ANTI-SENSE: no 

(jd) SEQUENCE DESCRIPTION: SEQIDNO: 4: 

CNN ATG CTT ^ 

25 (6) INFORMATION FOR SEQ ID NO: 5: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 9 basee 

30 (B) TYPE: nttdeicadd 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(U) MOLECULE TYPE: DNA (genomic) 

(ii) HYPOTHETICAL: yes 
(iv) ANTI-SENSE: no 

40 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 5: 

THH ATG TTT ^ 



35 



47. 



(7) 



30 



3S 



40 



INFORMATION FOR SEQ ID NO: 6: 



(I) SEQUENCE CHARACTERISTICS: 

. (A) LENGTH: 9 bases 

® (B) TYPE: nndeicadd 

(C) STRANDEDNBSS: singl* 

<0> TOP(».OOyt Umw 

10 (U) MOLECULETYPE: DNA(««0Bie) 

(U> HYPOTKBTICAL: yw 

(iv) ANTI-SENSE: n» 

(«) SEQUENCEDESCRIPTI0M:SBQ1DN0:6: 

CNN ATG TTT 
20 (8) INFORMATION FOR SEQ JD NO: 7: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 291»iMt» 

2S (B) TYPE: nudeicawl 

(C) STRANDEDNBSS: siogle 

b> TOPOLOGY: liam 

(a) MOLECULE TYPE: DNA(ginamle) 

(ii) HYPOTHETICAL: yw 
(iv) ANn«SENSE: M 

(a) SEQUENCE DESCRIPTION: SEQ ID NO: 7: 



(9) 



TTC CCA 6CT TGG AtC CAT 
CCA CCA TGG TT 

INFORMATION FOR SEQ m NO: 8: 



(i) SEQUENCE CHARACTERISTICS: 

45 (A) LENGTH: 29ba8M 

(B) TYPE: BodMcan* 

(C) STRANDEDNBSS: single 
(D> TOPOLOGY: Baear 

50 (U) MOLECULE TYPE: ONA(g«n«ie) 
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(ii) HYPOTHETICAL: no 

(iv) ANTI-SENSE: no 

5 (a) SEQUENCE DESCBIPTION: SEQIDNO: 8: 

TTC GGA CCT ICC ATC CAT 
ecT eoA Toc w 

10 (10) INFORMATION FOR SEQIDNO: 9: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 30 bam 

15 (B) TYPE: oudeicacid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linoar 



18 

a.1 
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(ii) MOLECULE TYPE: DNA (genomic) 
(ii) HYPOTHETICAL: no 
(iv) ANTI-SENSE: no 
26 (sd) SEQUENCE DESCRIPTION: SEQIDNO: 9: 

CCA GCA TOG AG6 AXT CGA TCC 21 

Tcc ATG err ' 
30 (11) INFORMATION FOR SEQIDNO: 10: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 62 bases 

35 (B) TYPE: nucleic add 

(C) STRANDEDN^: sin^e 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(ii) HYPOTHETIC^ yw 

(iv) ANTI-SENSE: yea 

45 (a) SEQUENCE DKCRIPTION: SEQ ID NO: 10: 

6GA CCA TCC ATT CCT CCA TCC TCC U 
CAC AAC TAT CTC AGA ACC AAA TOT GAfi C 28 

50 



40 
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(12) INFORMATION FOR SEQ ID NO: 11: 

(i) SEQUENCE CHARACTERISTICS: 

. (A) LENGIB: ISbaaes 

' (Bi TYPE: Nudeie add 

(C) STBANDEDNESS: single 

an TOPOLOGY: UiiMT 

10 (ii) MOLECULE TYPE: DNA (genomic) 

(ii) HYPOTHETICAL: no 
(jv) ANTI-SENSE: no 

W) SEQUENCE DESCRIPTION: SEQ ID NO: XI: 
CTG GGG etc GAG CTT TGC 
20 (13) INFOSB4ATIONFORSEQEDNO:12 
(i> SEQXffiNCE CHARACTERISTICS: 

(A) LENGTH: 47ba»« 

oc (B) TYPE: Nucleic aad 

(C) STRANDEDNESS: nasle 

(D) TOPOLOGY: toesr 

(a> MOLECULE TYPE: DNA^enoBie) 
(U) HYPOIHETICAL: no 
(iv) ANTI-SENSE: iw62 



30 



35 



40 



(a) SEQUENCEOESCRIPnONiSBQIDNO: 12: 

GGT AAG TGC GGC CGC TAC TAA CTC TCT CCT 30 
CCC TCC TTT TTC CTG 6A 

(14) INFORMATION FOR SEQ ID NO: 13 
(i) SEQUENCE CHARACTERISnCS: 



(A) LENGTH: 47 bases 

(B) TYPE: NttdMcadd ^ 

(C) STBANDEDNSSS: single 

(D) TOPOLOGY: Uniar 

50 (ii) MOLECULE TYPE: DNA (genome) 

(a> HYPOTHETICAL: no 
(iv) ANTI-SENSE: yes 
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(a) SEQUENCE DESCBIPnON: SEQIDNO: 13: 

CCA AAA AGO ACS GAG GAG AGA GTt AGT AGC G6C 33 

CCC ACT TAC CTG CA 
(15) INFORMATION FOB SEQ ID NO: U 
(I) SEQUENCE CHARACTERISTICS: 



(A) LENGTH: Abases 

(B) TYPE: Nodeieado 

(C) STRANDEDNESS: sin^e 

(D) TOPOLOGY: linear 

" (u) MOLECXJLETVPE: DN A <««ioniie) 

(li) HYPOTBETTCAL: no 

20 (iv) ANTI-SENSE: no 

(il) SEQUENCE OESCBlPnON: SEQIDNO: 14: 

12 

TC6 ATT AAT TAA 
^ (16) INFORMATION FOR SEQIDNO: 18 
(i) SEQUENCE CHARACTERlSnCS: 

30 (A) LENGTH: MbasM 

(B) TYPE: NuddeMd 

(C) STRANDEDNESS: sia^ 
(D> TOPOLOGY: linear 

35 (il) MOLECULE TYPE: DVAig^acme) 

(S) HYPUTUETICAL: no 

(iv) ANTI-SENSE: no 

(a) SEQUENCE DESCRIPTION: SEQIDNO: 15: 



40 



TC6 AAG CG6 CCC CT 



45 (17) INFORMATION FOB SEQ ID Na 16 

(i) SEQUENCE CHARACTEBISTICS: 

(A) LENGTH: 37 bases 

so (B) TYPE: Nodeicadd 

(C) STRANDEDNESS: singla 

CD) TOPOLOGY: linear 

(u) MOLECULE TYPE: DNACgenonic) 

41- 



(jj) HYPOTHETICAL: W 
(iv) ANTI-SENSE: no 

(x» SEQIIENCB DESCRIPTION: SEQ ID NO: 16 

OCA T6C «6T *CC GGA TCC ATC CA6 CTA 
CTA GCT TT6 C 

(18) INFORMATION PGR SEQ ID NO: 17 
(i) SEQUENCE CHARACTERISnCS: 

(A) LENGTH: 47b»m 

(B) TYPE: Nttddc »od 

(C) STRANDEDNESS: tngis 

(D) TOPOLOGY: linw 

(ii) MOLECULE TYPE: DNACgMomte) 

(a) HYPOTHETICAL: no 

(iv) ANTI-SENSE: no 

(3d) SEQUENCE DESCRIPTION: SEQ ID NO: 17 

CTS ACT AGO CCT AOA CCC GCC CCA CTT ACC 
?S ACT T» TCC AGG GC 
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WhatttdakMdia: 

recombinant deoxyiibenudwc add techiuques, said vector conpriaii* at lean 
one dominant aelectable marker, whemin the tranataticm initiatiim start site 
said marker compiisea the fottowing sequence: . 

10 -3 *l 

Pyxx AtG Pyxx 

where "Py" i» a pyrimidine nucleotide; V.U a nudeotide; and the numerical 
designationa are relative to Ae codon "ATG". 

15 

2. The eipraaaioii vector of daim I wherein the nudeic add sequence 
encoding far the pwtmn of intereet ia o>4iakad to aaid dominant seleetable 
marker. 

3. The expresaion vector ofdaiml wherein said dominant selecublc 
marker is selected ftom the group consisting oft herpes simplex virus thymidine 
kinase, adenosine deaminase, asparagine synthetase. Salmonella his D gene, 
xanthine guanine phoephoiibosyl tranafetaae. hygremydn B phosphotransferase, 
and neomycin ^esphetnaaferase. 

4. The eipression vector of daim 1 wheiein said transUtion initiation 
start site sequence is selected from the group eonsiating of TtxATGCxx; 
CxxATCCxx: CxxATOTrt; and TiiATCTTa. where "x" is a nudeotide. with the 
proviso that the eodon-W downstream ofthe ATG codondoea not encode a 

30 stop oodon. 



20 



25 
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5. Tht tsprcsai a vector ofclaia I whaiwn said trtnsUtfon initiaUon 
start aita swjumea is TtaATGC«. whe» V i» • nudwtid.. 

6. Tka«i«ari«v«torofdaimlwlimm»aidt^ 
5 start 8iU8«|a«ii»UTCCATQCTT. 

7. The expression vector of daim 1 whtmn said translation initiation 
start siU sequanee is located within a secondaiy struewre. 
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8. Tb» «ci«s8ion vector of daia I wheiein said translation initiation 
start site sequence farther comprises at least one outHtf-fr-ne start codon within 
about 1000 nactootidea rfthe ATO start codon of .«d start site, with the proviso 
that no in.ft«n. stop codon is located within said 1000 nudeolidee. 

9. TlUaipiesaioB vector ofclaiml wherein said translation initiation 
.tart sita sequence further comprises at least one out-of-frame start «don within 
aboutaSO nudeotide. of the ATG start codon of «rid start site, with the proviso 
that no in.ft«ne stop eodon is located within said 350 nucleotides. 

20 10. Theexpressionvectorofd««lwhereinseidtranstotioni«itUuon 

start site sequence further comprises at least one outK,f.frama start codon within 
about50nudeotidesoftheATGstattcodonofBaid«tart8ite.witbtbeproviso 

that no in-ft«ne atop codon is located within said 50 nudeotidea. 



25 



11. The expression vector of daims 8. 9 and 10 wherein 
frame start codon U part of a consensus KoMk sequence. 
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12, The expwsaon vecwr «f daim 10 wherein said out-of-frame start 
codon and said taaslatien initiation atari slU sequence are both indud d as 
part of a seeondaTy stroctare. 

5 13. The expression vector of daims a, 9, 10 wherein said translaUon 

initiation start site sequence is part of a seeondary strucnwe and said out-of- 
frame start codon is not part of said secondary structure. 

14. A dominant sdeetaUe marker encoded by a nudeie add sequence. 
10 wherein the translation initiation start site of said dominant sdectable marker is 
selected from the group consisting of TncATGCxi; CxxATGCxx; CxxATGTxx; 
and TwcATGTax, where Y is a nudeotide. with the proviso that Tte" 
downstream of the ATG codon does not encode a stop codon. 

15 15. The material of claim 14 i^rein said dominant selectable marker 

is selected from the group consisting of heipes simplex virus thymidine kinase, 
adenosine deaminase, asparagine synthetase. Salmondla his D gene, xanthine 
guanine phosphoriboe^ transferase, hygromydn B phosphotransferase, and 
neomycin phosphotransferase. 

20 

16. The material of daim 14 wherein said translation initiation start 
site sequence is TxsAIQCae, where Y it a nudeotide. 

17. The material of daim 14 wherein said transUtion initiation start 
25 site sequence is TCCATGCTT. 
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18. The munial of d«i« I* vrt»emn said translation initiation start 
Mte se«tt*n« is loeatad withia • sMoadaiy structure. 

19. The mataiial of claim 14 wherein said translation initiation start 
5 site sequence farther comprises at least •neoBtHif.frMMttMteodonwitWn 

about 1000 nucleotides oftheATG start eodoaof said wart rite, with the proviso 
that no in.fra«e stop eodon is located wilUD seid WOO nucleotides. 

20. TiiemaierialofclaimUwheteiftBaidtraiMtotionliiitlstlonstart 
10 site sequence farther cwaprisei at toarteneoutof-fraiitt start eodon within 

about 350 nodeotidea of the ATG st^ eodon of said start site, with the imrrieo 
that no in-fraine stop eodon is loeatad within said 380 sudeotidaa. 

21. -nwniatarialofclaim 14 wherein aaid translation initiation start 
15 site sequence farthercomprieeeatleartoneout^-fraaastarteodon within 

about 50 nudaotidea of the ATG start eodon of said atart sit^ with the proviso 
that no in-frame stop eoden ia located within said 50 nodeotidea. 

22. Hwrnntarialafdaima 19. 20 and 21 wherein aaidoutHif-frarae start 
20 codonispartofaeaaaenansKosakseqnenea. 

23. ibematwtalafdalm 21 wherein said ouKf-frame start codon and 
said trmslation initiatiaa start aite sequenee an both induded as part of a 
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secondary structaia. 

24. The mataiial ofdaiaa 19, 20. 21 wherein said translation imtiation 
startdtaaequancais part of a seeoadaryatructure and said outHrf-fraae start 
eodon is not part of said seeandaiy itnctttia. 
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(iaeludtd withiii AoMrifltt Tyi» Cuto« CdD-rttw d«pofit miater 69120) and 
ANBX 2 Ottdodad wltWn ATCC dtposlt auaber fiSllBX 

26. AplMBdd«wilw*.taft»««l»~^»««"'*'*^^****^^ 
nuddc add .«|u«a« «epdiiw te ..id im»td» of interest i. cUated to «iid 
demiaant aeleckaUe marker. 

27. The plaamidofdaim 26 integrated within the ceUdar 
dewtytibeaueleie add of • mamaaHan host cdl. 

28. The plaemidofdaia 27 wherein laid hoatcrilia selected from the 
group coneUting of DG44. DXBll. CVl. COa R1610. SP2». P3«833.Ag8.653, 
BPA-lclBPT, RAJI, and 293. 

29. Theexpre.Monwetorofdaialfuithercoaii«isintanarti6d 

intronic inwrtion region within said de»in«>t sdeetabl. mailnr. wherein ai. 
encoding sein-nc. for a protein of i-t«e.t i. leeated withia ..id inee^ 



20 30. TliadoninantselectaUeaarkerofdaiinMtethereomprismgan 

artificial tnatronie insertian TCgioB. 



•67. 
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Neomycin phosphotransferase gene 

Cla I -3 

TCAE 5.2 TTGGGAGCTTGG AKGAI CC A cc ATG i>.H 

Mel vol 

ANEX 1 TTGGGAGCTTGG ATCGAT CC I cc ATG .Cltt 

Met Leu 

ANEX 2 CCaGCATGgAGGA ATCGAT CC 1 cc ATG Ctt 

Mel Leu 



FIG. 1 
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TCAE 5.2 vs ANEX' l(TCAE12) 
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TCAE 5.2 vs ANEX 1 vs ANEX 2 
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tCAE 5 vs necj:pla 
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